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1 AAAACCCCAA AACCCCAAAA CCCCTTTTAG AGCCCTGCAG TTGGAAATAT 
5 I AACCTCAGTA TTAATAAGCT CAGATTTTAA ATATTAATTA CAAAACCTAA 
101 ATGGAGGTTG ATGTTGATAA TCAAGCTGAT AATCATGGCA TTCACTCAGC 
1 5 1 TCTTAAGACT TGTGAAGAAA TTAAAGAAGC TAAAACGTTG TACTCTTGGA 
20 1 TCCAGAAAGT TATTAGATGA AGAAATCAAT CTCAAAGTCA TTATAAAGAT 
25 1 TTAGAAGATA TTAAAATATT TGCGCAGACA AATATTGTTG CTACTCCACG 
301 AGACTATAAT GAAGAAGATT TTAAAGTTAT TGCAAGAAAA GAAGTATTTT 
35 1 CAACTGGACT AATGATCGAA CTTATTGACA AATGCTTAGT TGAACTTCTT 
40 1 TCATCAAGCG ATGTTTCAGA TAGACAAAAA CTTCAATGAT TTGGATTTCA 
45 1 ACTTAAGGGA AATCAATTAG CAAAGACCCA TTTATTAACA GCTCTTTCAA 
50 1 CTCAAAAGC A GTATTTCTTT CAAG ACG AAT GGAACCAAGT TAG AGCAATG 
55 1 ATTGGAAATG AGCTCTTCCG ACATCTCTAC ACTAAATATT TAATATTCCA 
60 1 GCGAACTTCT GAAGGAACTC TTGTTCAATT TTGCGGGAAT AACGTTTTTG 
65 1 ATCATTTGAA AGTCAACGAT AAGTTTGACA AAAAGCAAAA AGGTGGAGCA 
701 GCAGACATGA ATGAACCTCG ATGTTGATCA ACCTGCAAAT ACAATGTCAA 
75 1 GAATGAGAAA GATCACTTTC TCAACAACAT CAACGTGCCG AATTGGAATA 
80 1 ATATGAAATC AAGAACCAGA ATATTTTATT GCACTCATTT TAATAGAAAT 
85 1 AACCAATTCT TCAAAAAGCA TGAGTTTGTG AGTAACAAAA ACAATATTTC 
901 AGCGATGGAC AGAGCTCAGA CGATATTCAC GAATATATTC AGATTTAATA 
95 1 GAATTAGAAA GAAGCTAAAA GATAAGGTTA TCGAAAAAAT TGCCTACATG 
1 00 1 CTTGAGAAAG TCAAAGATTT TAACTTCAAC TACTATTTAA CAAAATCTTG 
1 05 1 TCCTCTTCCA GAAAATTGGC GGGAACGGAA ACAAAAAATC GAAAACTTGA 
1 101 TAAATAAAAC TAGAGAAGAA AAGTCGAAGT ACTATGAAGA GCTGTTTAGC 
1151 TACACAACTG ATAATAAATG CGTCACACAA TTTATTAATG AATTTTTCTA 
1201 CAATATACTC CCC AAAGACT TTTTGACTGG AAGAAACCGT AAGAATTTTC 
1 25 1 AAAAGAAAGT TAAGAAATAT GTGGAACTAA ACAAGCATGA ACTCATTCAC 
1301 AAAAACTTAT TGCTTGAGAA GATCAATACA AGAGAAATAT CATGGATGCA 
1351 GGTTGAGACC TCTGCAAAGC A I I 1 1 1ATTA TTTTGATCAC GAAA ACATCT 
1401 ACGTCTTATG GA AATTG CTC CG ATGG ATAT TCGAGGATCT CGTCGTCTCG 
1451 CTGATTAGAT GATTTTTCTA TGTCACCGAG CAACAGAAAA GTTACTCCAA 
1 50 1 AACCTATTAC TACAGAAAGA ATATTTGGGA CGTCATTATG AAAATGTCAA 
1 55 1 TCGCAGACTT AAAGAAGGAA ACGCTTGCTG AGGTCCAAGA AAAAGAGGTT 
1 60 1 GAAGAATGG A AAAAGTCGCT TGG ATTTGCA CCTGGAAAAC TC AGACTAAT 
1 65 1 ACCGAAGAAA ACTACTTTCC GTCCAATTAT GACTTTCAAT AAGAAGATTG 
1 70 1 TAAATTCAGA CCGGAAGACT ACAA AATTAA CTACAAATAC GAAGTTaTTG 
1 75 1 AACT CTCACT TAATGCTTAA GACATTGAAG AATAGAATGT TTAAAGATCC 
1801 TTTTGGATTC GCTGTTTTTA ACTATGATGA TGTAATGAAA AAGTATGAGG 
1851 AGTTTGTTTG CAAATGGAAG CAAGTTGGAC AACCAAAACT CTTCTTTGCA 
1901 ACTATGGATA TCGAAAAGTG ATATGATAGT GTAAACAGAG AAAAACTATC 
1 95 1 AACATTCCTA AAAACTACTA AATTACTTTC TTCAGATTTC TGGATTATGA 
200 1 CTGCACAAAT TCTAAAGAGA A AGAATAACA TAGTTATCGA TTCGAAAAAC 
2051 TTTAGAAAGA AAGAAATGAA AGATTATTTT AGACAGAAAT TCCAGAAGAT 
2101 TGCACTTGAA GGAGGACAAT ATCCAACCTT ATTCAGTGTT CTTGAAAATG 
2151 AACAAAATGA CTTAAATGCA AAGAAAACAT TAATTGTTGA AGCAAAGCAA 
220 1 AGAAATTATT TTAAGAAAG A TA ACTTACTT CAACCAGTCA TTAATATTTG 
22S I CCAATATAAT TACATTAACT TTAATGGGA A Gill 1' ATA A A CAAACAAAAG 
2301 GAATTCCTCA AGGTCTTTGA GTTTCATCAA TTTTGTCATC ATTTTATTAT 
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FIGURE 10 

1 MEVDVDNQAD NHGIHSALKT CEEIKJEAKTL YSWIQKVIRC RNQSQSHYKD 
51 LEDIKIFAQT NIVATPRDYN EEDFKVIARJC EVFSTGLMIE LIDKCLVELL 
101 SSSDVSDRQK LQCFGFQLKG NQLAKTHLLT ALSTQKQYFF QDEWNQVRAM 
151 IGNELFRHLY TKYLIFQRTS EGTLVQFCGN NVFDHLKVND KFDKKQKGGA 
201 ADMNEPRCCS TCKYNVKNEK DHFLNNINVP NWNNMKSRTR IFYCTHFNRN 
251 NQFFKKHEFV SNKNNISAMD RAQTIFTNIF RFNRIRKKLK DKVIEKIAYM 
301 LEKVKDFNFN YYLTKSCPLP ENWRERXQKJ ENLINKTREE KSKYYEELFS 
351 YTTDNKCVTQ FINEFFYNIL PKDFLTGRNR KNFQKKVKKY VELNKHELIH 
401 KNLLLEKINT REISWMQVET SAKHFYYFDH ENIYVLWKLL RWIFEDLWS 
451 LERCFFYVTE QQKSYSKTYY YRKNIWDVIM KMSIADLKKE TLAEVQEKEV 
501 EEWKKSLGFA PGKLRLIPKK TTFRPIMTFN KKJVNSDRKT TKLTTNTKLL 
551 NSHLMLKTLK NRMFKDPFGF AVFNYDDVMK KYEEFVCKWK QVGQPKLFFA 
601 TMDDEKCYDS VNREKLSTFL KTTKLLSSDF WIMTAQILKR KNNTVIDSKN 
651 FRKKEMKDYF RQICFQKIALE GGQYPTLFSV LENEQNDLNA KKTLIVEAJCQ 
701 RNYFKKDNLL QPVrNICQYN YINFNGKFYK QTKGIPQGLC VSSILSSFYY 
751 ATLEESSLGF LRDESMNPEN PNVNLLMRLT DDYLLITTQE NNAVLFIEKL 
801 INVSRENGFK FNMKKLQTSF PLSPSKPAKY GMDSVEEQNI VQDYCDWIGI 
851 SIDMKTLALM PNINLR1EGI LCTLNLNMQT KJCASMWLKKK LKSFLMNNIT 
901 HYFRKTITTE DFANKTLNKL FISGGYKYMQ CAKEYKDHFK KNLAMSSMID 
951 LEVSIOIYSV TRAFFKYLVC NIKDTIFGEE HYPDFFLSTL KHFIEIFSTK 
1001 KYIFNRVCMI LKAKEAKLKS DQCQSLIQYD A 




FIGURE 11 

I CCCCAAAACC CCAAAACCCC AAAACCCCTA TAAAAAAAGA AAAAATTGAG 
5 1 GTAGTTTAGA AATAAAATAT TATTCCCGCA CAAATGGAGA TGGATATTGA 
1 0 1 TTTGGATGAT ATAGAAAATT TACTTCCTAA TACATTCAAC AAGTATAGCA 
1 5 1 GCTCTTGTAG TGACAAGAAA GGATGCAAAA CATTGAAATC TGGCTCGAAA 
201 TCGCCTTCAT TGACTATTCC AAAGTTGCAA AAACAATTAG AGTTCTACTT 
25 1 CTCGGATGCA AATCTTTATA ACGATTCTTT CTTGAGAAAA TTAGTTTTAA 
301 AAAGCGGAGA GCAAAGAGTA GAAATTGAAA cattactaat gtttaaataa 
35 1 aatcaggtaa tgaggattat tctatttttt agatcacttc ttaaggagca 
401 ttatggagaa aattacttaa tactaaaagg taaacagttt ggattatttc 
45 1 cctagccaac aatgatgagt atattaaatt catatgagaa tgagtcaaag 
50 1 gatctcgata catcagactt accaaagaca aactcgctat aaaacgcaag 
55 1 aaaaagtttg ataatcgaac agcagaagaa cttattgcat ttactattcg 
601 tatgggtttt attacaattg ttttaggtat cgacggtgaa ctcccgagtc 
65 1 ttgagacaat tgaaaaagct gtttacaact gaaggaatcg cagttctgaa 

701 AGTTCTGATG TGTATGCCAT TATTTTGTGA ATTAATCTCA AATATCTTAT 
751 CTCAATTTAA TGGATAGCTA TAGAAACAAA CCAAATAAAC CATGCAAGTT 
801 TAATGGAATA TACGTTAAAT CCTTTGGGAC AAATGCACAC TGAATTTATA 
851 TTGGATTCTT AAAGCATAGA TACACAGAAT GCTTTAGAGA CTGATTTAGC 
90 1 TTACAACAGA TTACCTGTTT TGATTACTCT TGCTCATCTC TTATATCTTT 
951 AAAAGAAGCA GGCGAAATGA AAAGAAGACT AAAGAAAGAG ATTTCAAAAT 
1 00 1 TTGTTG ATTC TTCTGTAACC GGAATTAACA ACAAGAATAT TAGCAACGAA 
1 05 1 AAAGAAG AAG AGCTATCACA ATCCTGATTC TTAAAGATTT CAAAAATTCC 
1101 AGGTAAGAGA GATACATTCA TTAAAATTCA TATATTATAG 1111 1 CATTT 
1151 CAC AGCTGTT ATTTTCTTTT ATCTTAACAA TATTTTTTGA TTAGCTGGAA 
1201 GTAAAAAGTA TCAAATAAGA GAAGCGCTAG ACTGAGGTAA CTTAGCTTAT 
1 25 1 TCACATTCAT AGATCGACCT TCATATATCC AATACGATGA TAAGGAAACA 
1 30 1 GCAGTCATCC GTTTTAAAAA TAGTGCTATG AGGACTAAAT TTTTAGAGTC 

1351 AAGAAATGGA gccgaaatct taatcaaaaa gaattgcgtc GATATTGCAA 

1401 AAG AATCG A A CTCTAAATCT TTCGTTAATA AGTATTACCA ATCTTGATTG 
145 1 ATTGA AGAG A TTGACGAGGC AACTGCACAG aAGATCATTA aagaaataaa 

1501 gtaactttta ttaattagag aataaactaa attactaata tagagatcag 
1551 CGATCTTCAA ttgacgaaat aaaagctgaa ctaaagttag acaataaaaa 
1 60 1 ATACAAACCT tggtcaaaat attgaggaag gaaaagaaga ccagttagca 

165 1 AAAGAAAAAA TAAGGCAATA AATAAAATGA GTACAGAAGT GAAGAAATAA 

1701 AAGATTTATT tttttcaata atttattgaa aagaggggtt ttggggtttt 

1 75 1 GGGGTTTTGG GG 
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FIGURE 18 
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FIGURE 19 



! aactcatrta artactaan taatcaacaa gattgataaa aagcagtaaa taaaacccaa 
6 1 tagatrtaat ttagaaagta tcaangaaa aatggaaatt gaaaacaact aagcacaau 
1 2 1 gccaaaagcc gaaaaactgt ggtgggaact tgaattagag atgcaagaaa accaaaatga 
181 tatataagti agggttaaga ctgacgatcc taagcaatat ctcgtgaacg tcactgcagc 
24 1 atgtttgttg taggaaggia gttactacta agataaagat gaaagaagat atatcatcac 
301 taaagcacn cttgaggtgg ctgagtctga tcctgagttc atctgctagt tggcagtcta 
361 catccgtaat gaacrttaca tcagaactac cactaactac artgtagcat tttgtgttgt 
421 ccacaagaat actcaaccat tcatcgaaaa gtacttcaac aaagcagtac ttttgcctaa 
481 tgacttactg gaagtctgtg aacttgcata ggttctctat atttttgatg caactgaan 
54 1 caaaaamg tatcttgata ggatactttc ataagatatt cgtaaggaac tcactttccg 
60 1 taagtgttia caaagatgcg tcagaagcaa gtntctgaa ttcaacgaat actaacttgg 
66 1 taagtattgc actgaatcct aacgtaagaa aacaatgttc cgttacctct cagttaccaa . 
72 1 caagtaaaag tgggattaaa ctaagaagaa gagaaaagag aatctcttaa ccaaacttta 
781 ggcaataaag gaatctgaag ataagtccaa gagagaaact ggagacataa tgaacgttga 
84 1 agatgcaatc aaggctrtaa aaccagcagt tatgaagaaa atagccaaga gatagaatgc 
901 catgaagaaa cacatgaagg cacctaaaat tccfaactct accttggaat caaagtacn 
96 1 gaccttcaag gatctcatta agnctgcca tatttctgag cctaaagaaa gagtctataa 
1021 gatccnggt aaaaaatacc ctaagaccga agaggaatac aaagcagcct ttggtgattc 
1 08 1 tgcatctgca cccttcaatc ctgaattggc tggaaagcgt atgaagattg aaatctctaa 
1 141 aacatgggaa aatgaactca gtgcaaaagg caacactgct gaggtttggg ataatttaat 
1201 ttcaagcaat taactcccat atatggccat gttacgtaac ttgtctaaca tcttaaaagc 
1261 cggtgtttca gatactacac actctattgt gatcaacaag atttgtgagc ccaaggccgt 
1321 tgagaactcc aagatgttcc ctcttcaatt ctttagtgcc attgaagctg ttaatgaagc 
1381 agttactaag ggattcaagg ccaagaagag agaaaatatg aatcttaaag gtcaaatcga 
1441 agcagtaaag gaagttgttg aaaaaaccga tgaagagaag aaagatatgg agttggagta 
1501 aaccgaagaa ggagaatttg naaagtcaa cgaaggaatt ggcaagcaat acattaactc 
1561 cattgaacrt gcaatcaaga tagcagrtaa caagaattta gatgaaatca-aaggacacac 
1 62 1 tgcaatcnc tctgatgm ctggnctat gagiacctca atgtcaggtg gagccaagaa 
1681 gtatggttcc gncgtacn gtctcgagtg tgcanagtc cttggtnga tggtaaaata' 
1 74 1 acgngtgaa aagtcctcat tctacatca cagttcacct agnctcaat gcaataagtg 
1801 ttacrtagaa gttgatctcc ctggagacga actccgtcct tctatgtaaa aacttttgca 
1861 agagaaagga aaacnggtg gtggiactga tncccctat gagigcattg atgaatggac 
1 92 1 aaagaataaa actcacgtag acaatatcgt tarmgtct gatatgatga ngcagaagg 
1981 atancagat atcaatgna gaggcagttc cattgttaac agcatcaaaa agtacaagga 
204 1 tgaagiaaat cctaacarta aaatcmgc agttgactta gaaggnacg gaaagtgcct 
2101 taarctaggt gatgagttca atgaaaacaa ctacatcaag atattcggta tgagcgattc 
2161 aatcnaaag ncarttcag ccaagcaagg aggagcaaat atggtcgaag ttatcaaaaa 
222 1 ctttgcccrt caaaaaatag gacaaaagtg agtncttga gattcttcta iaacaaaaat 
228 1 ctcaccccac rrrtttgttt tattgcatag ccattatgaa atttaaatta ttatctatn 
234 1 atttaagtta cnacatagt ttatgtatcg cagtctatta gcctattcaa atganctgc 
2401 aaagaacaaa aaagattaaa a 



FIGURE 20 

MEIENNQAQQPKAEKLWWELELEMQENQNDIQVRVKIDDPKQYL 
VNVTAACLLQEGSYYQDKDERRYIITKALLEVAESDPEFICQLAVYIRNELYIRTTTN 
YIVAFCWHKNTQPFIEKYFNKAVLLPNDLLEVCEFAQVLYIFDATEFKNLYLDRILS 
QDIRiCELTFRKCLQRCVRSKJSEFNEYQLGKYCTC 

KJCJU<£NLLTKLOAIK£SEDKSKR£TCDrMNVEDAIKALKPAVMKJOAKR 

APKIPNSTLESKYLTFKDLIKFCH1SEPICERVYKJLCKKYPKTEEEYKAAFGDSASAP 

FNPELAGKRMKIEISKTWENELSAKGNTAEVWDNLISSNQLPYTvIAMLRhiLSNILKAGV 

SDTTHSrVTNKJCEPKAVENSK^FPLQFFSAIEAVNEAVTKGFKAKXRENM^KGQIE 

AVK£VVEKTDEEK10)MELEQTEEGEFVKVNEGIGKQYn^SIELAJKIAVNKNLDEnCG 

HTAJFSDVSGSMSTSMSGGAKKYGSVRTCLECALVLGLMVKQRCEKSSFY1FSSPSSQ 

CNKCYLEVDLPGDELRJ'SMQKLLQEKGKLGGGTDFPYECIDEWTKNKTHVDNIVILSD 

N^lAEGYSDn^RGSSIVNSIKXYKLDEVNPNIKJFAVDLEGYGKCLNLGDEFNENNYT 

KJFGMSDSILKPISAKQGGANMVEVIKNFALQKJGQK 



FIGURE 21 

1 tcaatactat taattaataa ataaaaaaaa gcaaactaca aagaaaatgt caaggcgtaa 
6 1 ctaaaaaaag ccataggctc ctataggcaa tgaaacaaat cttgattrtg tattacaaaa 
121 tctagaagrt tacaaaagcc agattgagca etataagacc tagtagtaat agatcaaaga 
i 8 1 ggaggatctc aagcttttaa agttcaaaaa rtaagattag gatggaaact ctggcaacga 
24 1 tgatgatgat gaagaaaaca actcaaataa ataataagaa ttattaagga gagtcaatta 
301 gattaagtag caagrnaat tgataaaaaa agttggttct aaggtagaga aagatttgaa 
3 6 1 tttgaacgaa gatgaaaaca aaaagaatgg acrttctgaa tagcaagtga aagaagagta 
421 attaagaacg actactgaag aacaggttaa gtattaaaat ttagtattta acatggacta 
481 ccagttagat ttaaatgaga gtggtggcca tagaagacac agaagagaaa cagattatga 
54 1 tactgaaaaa tggtttgaaa tatctcatga ccaaaaaaat tatgtatcaa tttacgccaa 
60 1 ctaaaagaca tcatangn ggtggcrtaa agattamt aataaaaaca attatgatca 
661 tcttaatgta agcattaaca gactagaaac tgaagccgaa ttctatgcct ttgatgartt 
721 ttcacaaaca atcaaaccta ctaataattc ttactagacl.gttaacatag acgttaattt 
78 1 tgataataat ctctgtacac tcgcattgct tagattttta ttatcactag aaagattcaa 
84 1 tattttgaat ataagatctt cttatacaag aaattaatat aattttgaga aaattggtga 
901 gctacttgaa actatcttcg cagttgtcn ttctcatcgc cacttacaag gcattcattt 
96 1 acaagncct tgcgaagcgt tctaatattt agttaactcc tcatcataaa ttagcgttaa 
1 02 1 agatagctaa ttataggtat actcrttctc tacagactta aaattagttg acactaacaa 
1 08 1 agtccaagat tattttaagt tcrtataaga attccctcgt ttgactcatg taagctagta 
1141 ggctatccca gttagtgcta ctaacgctgt agagaacctc aatgttttac ttaaaaaggt 
1201 caagcatgct aatcttaan tagmctat ccctacctaa ttcaatntg atttctacrt 
1261 tgttaattta taacactcga aactagagn tggartagaa ccaaatattt tgacaaaaca 
1321 aaagcttgaa aatctacttt tgagtataaa ataatcaaaa aatcttaaat ttttaagatt 
1381 aaacttttac acctacgttg cttaagaaac ctccagaaaa cagatattaa aacaagctac 
1 44 1 aacaatcaaa aatctcaaaa acaataaaaa tcaagaagaa actcctgaaa ctaaagatga 
1S01 aactccaagc gaaagcacaa gtggtatgaa attttttgat catctttctg aattaaccga 
1S61 gcttgaagat ncagcgrta acngtaagc tacccaagaa atttatgata gcttgcacaa 
1 62 1 actrttgatt agaccaacaa amaaagaa grtcaaana agttacaaat atgaaatgga 
1681 aaagagtaaa atggatacat tcatagatcc taagaacatt tatgaaacct taaacaatct 
1 74 1 taaaagatgc tctgnaata tatcaaaicc ccatggaaac arncrtatg aaccgacaaa 
1 80 1 taaagactct actrmata aatnaagci gaccnaaac taagaanat aacacgccaa 
1861 gtatacrm aagtagaacg aaimaaa laataacgtt aaaagtgcaa aaangaatc 
1 92 1 ctccfcaha gaaagcttag aagatanga tagtctngc aaatctattg cncttgtaa 
1981 aaacnacaa aatgttaata ctatcgccag tttgccctat cccaacaata cnagaaaaa 
204 1 tcctttcaat aagcccaatc rtctatmt caagcaatrt gaataattga aaaatngga 
2101 aaatgtatct atcaactgra ttcttgatca gcatatacct aanctam cagaattcrt 
2161 agaaaagaat aaaaaaataa aagcancat tttgaaaaga tattatttat tacaatatta 
222 1 tcttgartat actaaactat naaaacact tcaatagna cctgaattaa anaagtna 
228 1 cattaatiag caactagaag aangactgt gagtgaagta cataagtaag tatgggaaaa 
2341 ccacaagcaa aaagctnct atgaaccart atgtgagtn atcaaagaat catcctaaac 
240 1 cctttagcta acagatmg accaaaacac tgtaagtgat gactctatta aaaagacttt 
246 1 agaatctata tctgagtcta agcatcatca namgaga ttgaacccta gnaatctag 
252 1 cagtttaatt aaatctgaaa acgaagaaat ttaagaactt ctcaaagcn gcgacgaaaa 
2581 aggtgtma gtaaaagcat actataaan ccctctatgt ttaccaactg gcacrtana 
264 1 cganacaat tcagatagat ggtganaar taaatarrag rnaaataaa tanaaacat 
2701 tgaatarnc tngcnan amgaataa tacatacaat agtcattm agcgtmga 
276 1 atararma gnamaat tcanamr aagtaaataa natrmca atcattnn 
2821 aaaaaatcg 
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MSRJWQKKPQAPIGNETNLDFVLQNLEVYKSQIEHYKTQQQQIK 

EEDLKJ.LKFIOJQDQDCNSGNDDDDEENNSNKQQELLRRVNQIKQQVQLIKKVGSKVEK 

DLNLNEDENKJCNGLSEQQVICEEQLRTITEEQVKYQNLVFNMDYQLDLNESGGHRRHRR 

ETDYDTEKV^EISHDQK>JYVSIYANQKTSYCWWLICDYFNKNNYDHLhA'S[NRLETEAE 

FYAFDDFSQTIKJ-TWSYQTTVNIDWFDWLCILALLRFLLSLERPhnLNmSSYTTlN 

QYNFEKIGELLET1FAVVFSHRHLQGIHLQVPCEAFQYLVNSSSQISVKDSQLQVYSF 

STDLKI.VDTNKVQDYFKJLQEFPRI/ITTVSQQAIPVSATN 

VSIPTQFNFDFYFVNLQHLKI.EFGLEPNILTKQKLENLLLSIKQSK>rLKFLRJLNFYTY 

VAQETSRKQILKQATTIKNLKNNKJ^QEETPETKJDETPSESTSGMKFFDHLSELTELED 

FSVNLQATQEIYDSLHKLLIRSTNLKJG^KLSYKYEMEKSKMDTFIDLKNIYETLNNLK 

RCSVNISNPHGNISYELTNKDSTFYKFKlTLNQELQHAKYTFKQh^QFNhA^KSAiaE 

SSSLESLEDIDSLCKSIASCKNLQNVNIIASLLYPNNIQKNPFNKPNLLFFKQFEQLK 

NLEhTVSrNCILDQHILNSISEFLEKNKKIKAFILKRYYLLQYYLDYTKLFKTLQQLPE 

LNQVY1>JQQLEELTVSEVHKQV\VENHKQKAFYEPLCEFIICESSQTLQLIDFDQNTVSD 

DSIKJC1LESISESKYHHYLRJLNPSQSSSLIKSENEEIQELLKACDEKGVLVKAYYKPP 
LCLPTGTYYDYNSDRW 



FIGURE 23 

MKJLFEFIQDKLDIDLQTNSTYKJENLKCGHFNGLDEILTTCFAL 

PNSRKJALPCLPGDLSHKAVIDHCIIYLLTGELYNNVLTFGYKJARNEDVNNSLFCHS 

ANVhA/TLLKGAAWKMFHSLVGTYAFVDLLINY'TVIQFNGQFFTQIVGNRCNEPHLPPK 

WVQRSSSSSATAAQIKQLTEPVTNKQFLHKLNINSSSFFPYSKILPSSSSIKKJLTDLR 

EAIFPTNLVKJPQRl.KVRJNLTLQKiLKJUiKJU.NYVSILNSICPPLEGTVLDLSHLSR 

QSPK^RVLKJIIVILQKXLPQEMFGSKKNKGKJIKNLNLLLSLPLNGYLPFDSLLKKL 

RLKI)FRWT.FISDIWFTKHNFENLNQLAICFISWLFRQLIPKIIQTFFYCTEISSTVTI 

NATRJiDT\VNKLITPFIVEYFKTYLVEhWVCR^HNSYTLSNFNHSKMRJIPKJ<^ 

IIAIPCRGADEEEFT1YK£NHKNAIQPTQKJLEYLRNKJIPTSFTXJYSPTQIADRJK£ 
FKQRLLKKJTWVLPELYFMKFDVKSCYDSIPR^^ 

TNTGVLKLFhAA^ASRVPKPYELYID^RTVHLSNQDVINVVEMEIFKTALWVEDKCY" 
IREDGLFQGSSLSAPIVDLVYDDLLEFYSEFKASPSQDTLILICLADDFLIISTDQQQV 
rNIKXLAMGGFQKYNAK^NRI3KJLAVSSQSDDDTVIQFCAMmFVK£LEVWKiiSSTMN 
NFH1RSKSSKGIFRSLIALFNTRJSYKTIDTNLNSTNTVLMQIDHVVKNISECYKSAF 

KDLSfNVTQNMQFHSFLQRIIEMTVSGCPITKCDPLIEYEVRFTILNGFLESLSSNTS 
KFKDNIILLRKEIQHLQAYIYIYIHFVN 



FIGURE 24 

Oxyfricha LCVS YILSSFYYANLEENALQFLRKESMDPEKPETNLLMRJLT 
Euplotes LCVSSILSSFYYATLEESSLGFLRJDESMNPENPNVNLLMRLT 
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FIGURE 25 



Motif 0 

huinan " AKFLHWLKSVYWELLRSFFYVTETTFQKMR 
tezl X8EIEWl.VLCKR6NAXMCLSDrEKRXQIFABrtWX.YKSPXIP2LQSPrYIXBSSDl.RllR 
EST2 LKDFRWLFISD- — IWrTKHKFEHLMQLAlCFISWLPRQLIPKIIQTFFYCTEISSTVT- 
0123 TREISWMQVET-SAXHFYYFDHEN-IYVLWKLLRWIFEDLWSLIRCFrYVTEQQKSYSX 

* * •#••** 

• • * •••• 

m Motif 1 

fuxnan i^FYRKSWSXLQSIGlRQHIJ<RVQLRDVSEAEVRQHREARPAIXTSRL^ 

t*zl TVYFRXDIWKIXCRPFI-TSMIWEAFEXINEHNVRMM^ 

EST 2 I VYFRHDTWHKLXTPFI VEYFXTYLVEKWCRNHHS YTLS — HFNKSKMRI I PXXSNNEF 

Si 2 3 TYY YRKHIWDVIMKHS I -ADLXXETLAEVQEXEVBEWXXS -LOFAPGXLRLI PKK--TTF 

Zl + * . * . 

yj ... 

I" Motif 7 
human 

RPlVHHDYVVGARTFRREKRAERLTSRVXALF-SVLNYERA 

"tezl RLTTN-LRKRFLIKMOSKKKMLVSTNQTLRPVASILXHLINEESSGIPFHLEVYHKLLTF 

r EST2 RIIAIPCRGADEEEFTIYKENHKMAIQPTQKILEYLRMKRPTSFTXIYSPTQIADRIKEF 

^123 RPIMTFHXXIVNS0RXTTKLTTHTKIXH6HLMIJCTLXN-RMFKDPFGFAVFHYDDVMXXY 

UJ * • * * ... 

*M Motif 3 (A) 

tori XKDLLKHRMFOR-KKYFVRI DIK5CYDRIXQDL-KFRI VKK-KI-XDPEPVIRKYATIHAT5 

EST 2 KQRLLKKFKMVLiPELYFMXFDVKSCYOS I PRMECMRILXD-ALKNENGFFVRSQYFFHTH 

p!23 KBFVCKWKQVGQPXLFFATMDIEXCYDSVNREXLSTFLKTTXLLSSDFWrMTAQILKRKH 



FIGURE 26 



ATTTATACTCATGAAAATCTTATTCGAGTTCATTCAAGACAAGCTTGACATTGATCTACA 
GACCAACAGTACTTACAAAGAAAATTTAAAATGTGGTCACTTCAATGGCCTCGATGAAAT 
TCTAACTACGTGTTTCGCACTACCAAATTCAAGAAAAATAGCATTACCATGCCTTCCTGG 
TG ACTTAAG C C ACAAAG C AGT CATTG ATCACTG CAT CATTTACCTGTTG ACGGGCGAATT 

ATACAACAACGTACTAACATTTGGCTATAAAATAGCTAGAAATGAAGATGTCAACAATAG 
TCTTTTTTG C CATTCTG CAAATGTTAACGTTACGTT ACTGAAAGGCG CTG CTTGGAAAAT 

GTTCCACAGTTTGGTCGGTACATACGCATTCGTTGATTTATTGATCAATTATACAGTAAT 

TCAATTTAATGGGCAGTTTTTCACTCAAATCGTGGGTAACAGATGTAACGAACCTCATCT 

GCCGCCCAAATGGGTCCAACGATCATCCTCATCATCCGCAACTGCTGCGCAAATCAAACA 

ACTTACAGAACCAGTGACAAATAAACAATTCTTACACAAGCTCAATATAAATTCCTCTTC 

TTTTTTTCCTTATAGCAAGATCCTTCCTTCATCATCATCTATCAAAAAGCTAACTGACTT 

G AG AG AAGCTATTTTTCC CACAAATTTGGTTAAAATTC CTCAGAGACTAAAGGTACGAAT 

TAATTTGACGCTGCAAAAGCTATTAAAGAGACATAAGCGTTTGAATTACGTTTCTATTTT 

GAATAGTATTTGCCCACCATTGGAAGGGACCGTATTGGACTTGTCGCATTTGAGTAGGCA 

ATCACCAAAGGAACGAGTCTTGAAATTTATCATTGTTATTTTACAGAAGTTATTACCCCA 

AGAAATGTTTGGCTCAAAGAAAAATAAAGGAAAAATTATCAAGAATCTAAATCTTTTATT 

AAGTTTACCCTTAAATGGCTATTTACCATTTGATAGTTTGTTGAAAAAGTTAAGATTAAA 

GGATTTTCGGTGGTTGTTCATTTCTGATATTTGGTTCACCAAGCACAATTTTGAAAACTT 

GAATCAATTGGCGATTTGTTTCATTTCCTGGCTATTTAGACAACTAATTCCCAAAATTAT 

ACAGACTTTTTTTTACTGCACCGAAATATCTTCTACAGTGAGAATTGTTTACTTTAGACA 

TGATACTTGGAATAAACTTATCACCCCTTTTATCGTAGAATATTTTAAGACGTACTTAGT 

CGAAAACAACGTATGTAGAAACCATAATAGTTACACGTTGTCCAATTTCAATCATAGCAA 

AATGAGGATTATACCAAAAAAAAGTAATAATGAGTTCAGGATTATTGCCATCCCATGCAG 

AGGGGCAGACGAAGAAGAATTCACAATTTATAAGGAGAATCACAAAAATGCTATCCAGCC 

CACTCAAAAAATTTTAGAATAC CTAAGAAACAAAAGG CCG ACTAGTTTTACTAAAATATA 

TTCTC C AACG C AAAT AG CTG AC CGTATCAAAG AATTTAAG CAGAGACTTTTAAAG AAATT 

TAATAATGTCI^ACCAGAGCTTTATTTCATGAAATTTGATGTCAAATCTTGCTATGATTC 

CATACCAAGGATGGAATGTATGAGGATACTCAAGGATGCGCTAAAAAATGAAAATGGGTT 

TTTCGTTAGATCTCAATATTTCTTCAATACCAATACAGGTGTATTGAAGTTATTTAATGT 

TGTTAACGCTAGCAGAGTACCAAAACCTTATGAGCTATACATAGATAATGTGAGGACGGT 

TCATTTATCAAATCAGGATGTTATAAACGTTGTAGAGATGGAAATATTTAAAACAGCTTT 

GTGGGTTGAAGATAAGTGCTACATTAGAGAAGATGGTCTTTTTCAGGGCTCTAGTTTATC 

TGCTCCGATCGTTGATTTGGTGTATGACGATCTTCTGGAGTTTTATAGCGAGTTTAAAGC 

CAGTCCTAGCCAGGACACATTAATTTTAAAACTGGCTGACGATTTCCTTATAATATCAAC 

AGACCAACAGCAAGTGATCAATATCAAAAAGCTTGCCATGGGCGGATTTCAAAAATATAA 

TGCGAAA GCCA ATAGAGACAAAATTTTAGCCGTAAGCTCCCAATCAGATGATGATACGGT 

TATTCAATTTTGTGCAATGCACATATTTGTTAAAGAATTGGAAGTTTGGAAACATTCAAG 

CACAATGAATAATTTCCATATCCGTTCGAAATCTAGTAAAGGGATATTTCGAAGTTTAAT 

AGCGCTGTTTAACACTAGAATCTCTTATAAAACAATTGACACAAATTTAAATTCAACAAA 

CACCGTTCTCATGCAAATTGATCATGTTGTAAAGAACATTTCGGAATGTTATAAATCTGC 

TTTTAAGGATCTATCAATTAATGTTACGCAAAATATGCAATTTCATTCGTTCTTACAACG 

CATCATTGAAATGACAGTCAGCGGTTGTCCAATTACGAAATGTGATCCTTTAATCGAGTA 

TGAGGTACGATTCACCATATTGAATGGATTTTTGGAAAGCCTATCTTCAAACACATCAAA 

ATTTAAAGATAATATCATTCTTTTGAGAAAGGAAATTCAACACTTGCAAGC 



) 

FIGURE 27 



AKFLHWLMSVYVVELLRSFFYVTETTFQKNRLFFYRKSVWSKLQSIGIRQHLKR 

VQLRDVSEAEVRQHREARPALLTSRLRFIPKPDGLRPIVNMDYVVGARTFRREKR 

AERLTSRVKALFSVLNYERA 
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SS^^^]^ ^^^^^^^^^^^gtgtgtacgtcgtcgagctgctcaggtc 

^CTTTTATGTCACGGAGACCACGra 
GGAAGAGTGTCTGGAGCAAGTTGCAAAGC^^ 

AG ? CAGGCCCGCCCTGCT GACGTCCAGACTCCGCTTCATCCCCAA^ 

GGCTGCGGCCGATTGTGAACATGGACTACGTCGTGGGAGCCAGA^^ 
GAGA S^ GAGGGCCGAGGG Tra 

GTGCTCAACTACGAGCGGGCGCG 



# 



• 



FIGURE 29 



MTEHHTPKSRJLR5LENQYVYLCTLNDYVQLVLRGSPASSYSNICERLRSDVQTSFSIFLHSTVVGF 

DSKPDEGVQFSSPKCSQSELJANVVKQMFDESFERRRNLLMKGFSMNHEDFRAMHVNGVQNDLV 

STFPNYLISILESKNWQLLLEIIGSDAMHYLLSKGSIFEALPNDNYLQISGIPLFKNNVFEETVSKKRX 

RTIETSITQNKSARKEVSWNSISISRFSIFYRSSYKJa^KQDLYFNLHSICDRhTTVHMWLOWIFPRQFG 

LINAFQVKQLHKVIPLVSQSTVVPKRLLKVYPLIEQTAKRLHRJSLSKVYNHYCPYIDTHDDEKILS 

YSLKPNQVFAFLRSlLVRVFPKLIWGNQRJFEIILKDLETFLiCLSRYESFSLHYLMSNIKlSEIEWLVL 

G1GISNAKMCLSDFEKJIKQIFAEFIYWLYNSFIIPILQSFFYITESSDLRNRTVYFRKDIWKLLCRPFFT 

SMKMEAFEKINEN>A/RJi4DTQKTTT.PPAVIRI.LPKKNTFRJ.ITNLRKRFLIKMGSNKXMLVSTNQT 

LRPVASILKHLINEESSGIPFNLEVYMKLLTFKKDLLKHRMFGRJCKYFVRIDIKSCYDRIKQDLMFR 

IVKKKLKDPEFVrRXYATIHATSDRATKNFVSEAFSYFDMVPFEKVVQLLSMKTSDTLFVDFVDY 

WTKSSSEIFKMLKEHLSGHIVKIGNSQYLQKVGIPQGSILSSFLCHFYMEDLIDEYLSFTKKKGSVL 

LRVVDDFLFITVNIOaDAKJCFLNLSLRGFEKHNFSTSLEKTVINFENSNGirNNTFFNESKKRMPFFG 

FSVNMRSLDTLLACPKIDEALFNSTSVELTKHMGKSFFYKILRSSLASFAQVFIDITHNSKFNSCCNI 

YRJ.GYSMCMRAQAYLKJ^KDIFIPQRMFITDLLNVIGRKIWKK1.AEILGYTSRRFLSSAEVKWLFC 
LGMRDGLKPSFKYHPCFEQLIYQFQSLTDL1KPLRPVLRQVLFLHRRJAD 



FIGURE 30 



ggtaccgatnacmccmcncataagctaattgcncctcgaacgctcctaaatctctggaaatatnnacaagaactcaataacaataccaagtcaaanccaatatgaagg 
tgnanagtgatcgataaiamctatniaicggicgnaccaagtataaggacaa^ 

ggncgcnacrrnaatcgtggiactgtmagctgctacncugccaaccgcgtgcnctaccccgtcanggatatagctcrtggagtagcicacagaaatccnacaaatcn 
ctgatgagactatanagaacarucagtccgtgcauncnaacatggagccnacactnagatgagtcacgtcgcatgatggagtatnggiatcatccaacgtngccng 
aaaagg^gataanamgcaaaatcatgtccrtagtggtggtaatccgcgaaagjttmgatgcn^ 
ciccmaacgcggjnutrmctatmctanctcatgrtgttccaaatatgtatca^^^^ 

aauatctaaanagmcgcnataangatagtagtagaaaganggtgartctactcgtgmtg^anagmaaagatacmgcaaaacatnanagctatcattauuaaa 
aaaatcctataanataaaurmtcaatacngcggtcaciatTtttttaaaacgnatgatcag 

gaccgaacaccatacccccaaaagcaggattcttcgctttctagagaatcaatatgtatacctatgta 
ccttaaatgattatgtacaacttGttttgagagggtcgccggcaagctcgtatagcaatatatgcgaa 
cgcttgagaagcgatgtacaaacgtcc1 1 i 1cta1 i 1 1 1c 1 1 cattcgactgtagtcggcttcgacagt 

AAGCCAGATGAAGGTGTTCAATTTTCTTCTCCAmAA 

ggatagctaatatatgggcagCTAATAGCGAATGTTGTAAAACAGATGTTCGATGAAAGTTTTGAGCGTCGAAGGA 

ATCTACTGATGAAAGGG l i i i CCATGgtaaggtanctaattgtgaaatamacctgcaanactgntcaaagagangtatttaaccgaiaaagAA 

TCATGAAGATTTTCGAGCCATGCATGTAAACGGAGTACAAAATGATCTCGTTTCTACTTTTCCTAATTA 

CCTTATATCTATACTTGAGTCAAAAAATTGGCAAC'I I I I GTTAGAAATgtaaaiaccggnaagaigttgcgcacmgaaca 

agactgacaagtatagTATCGGCAGTGATGCCATGCATTACTTATTATCGAAAGGAAGTATTTTTGAGGCrCTTC 

CAAATGACAATTACCTTCAGATTTCTGGCATACCAC 1 I 1 I I AAAAATAATGTGTTTGAGGAAACTGTGT 

CAAAAAAAAGAAAGCGAACCATTGAAACATCCATTACTCAAAATAAAAGCGCCCGCAAAGAAGTTTC 

CTGGAATAGCATTTCAATTAGTAGGTTTAGCA'l'ri l'l l" ACAGGTCATCCTATAAGAAGTTTAAGCAAGgt 

aaciaatactgnatccncaiaacmrmagATCTATATTTTAACTTACACTCTATTTGTGATCGGAACACAGTACACATG 

TGGCTTCAATGGA 1 1 I I 1 CCAAGGCAATTTGGACTTATAAACGCATTTCAAGTGAAGCAATTGCACAA 

AGTGATTCCACTGGTATCACAGAGTACAGTTGTGCCCAAACGTCTCCTAAAGGTATACCCTTTAATTGA 

ACAAACAGCAAAGCGACTCCATCGTATTTCTCTATCAAAAGTTTACAACCATTATTGCCCATATA^ 

CACCCACGATGATGAAAAAATCCTTAGTTATTCCTTAAAGCCGAACCAGGTGTTTGCGTTTCTTCGATC 

CATTCTTGTTCGAGTGTTTCCTAAATTAATCTGGGGTAACCAAAGGATATTTGAGATAATATTAAAAGg 

tangtataaaamanaccactaacgatrrtaccagACCTCGAAACTTTCTT^ 

ATTATTTAATGAGTAACATAAAGgtaatatgccaaatttttttt^ 

CCTTGGAAAAAGGTCAAATGCGAAAATGTGCTTAAGTGATTTTGAGAAACGCAAGCAAATATTTGCGG 
AATTCATCTACTGGCTATACAATTCGTTTATAATACCTATTTTACAATC 11111111 ATATCACTGAATC 
AAGTCATTTACGAAATCGAACTGTTTATTTTAGAA^^ 

TACATCAATGAAAATGGAAGCGTTTGAAAAAATAAACGAGgtaRnaaagtattttngcaaaaagctaatatmcagAACAA 

TGTTAGGATGGATACTCAGAAAACTACTTTGCCTCCAGCAGTTATTCGTCTATTACCTAAGAAGAATAC 

CTTTCGTCTCATTACGAATTTAAGAAAAAGATTCTTAATAAAGgunaartmggtcatcaatgtacmacnctaatctam 

cagATGGGTTCAAACAAAAAAATGTTAGTCAGTACGAACCAAACTTTACGACCTGTGGCATCGATACTG 

AAACATTTAATCAATGAAGAAAGTAGTGGTATTCCATTTAACTTGGAGGTTTACATGAAGCTTCTTACT 

TTTAAGAAGGATCTTCTTAAGCACCGAATGTTTGGgtaattataiaatgcgcgancctcatiartaatm 

TATTTTGTACGGATAGATATAAAATCCTGTTATGATCGAATAAAGCAAGATTTGATGTTTCGGATTGTT 

AAAAAGAAACTCAAGGATCCCGAATTTGTAATTCGAAAGTATGCAACCATACATGCAACAAGTGACCG 

AGCTACAAAAAACTTTGTTAGTGAGGCG 1111 CCTATTgtaagntatttntcanggaanttnaacaaanctttmagTTG ATAT 

GGTGCCl 1 1 IGAAAAAGTCGTGCAGTTACTTTCTATGAAAACATCAGATACTTTGTTTGTTGATTTTGT 

GGATTATTGGACCAAAAGTTCTTCTGAAA 1 1 I 1 1 AAAATGCTCAAGGAACATCTCTCTGGACACATTGT 

TAAGgtataccaangagaactgtaataacactaatgaaactagATAGGAAATTCTCAATACCTTCAAAAAGTTGGTATCCCTC 

ACCGCTCAATTCTGTCAT U 1111 1 GTGTCATTTCTATATGGAAGATTTGATTGATGAATACCTATCGTT 

TACGAAAAAGAAAGGATCAGTG7TGTTACGAGTAGTCGACGATTTCCTCTTTATAACAGTTAATAAAA 

AGGATGCAAAAAAATTTTTGAATTTATCTTTAAGAGg 

ACACAATTTTTCTACGAGCCTGGAGAAAACAGTAATAAACTTTGAAAATAGTAATGGGATAATAAACA 
ATAC' l 1 11 1 1 1 AATGAAAGCAAGAAAAGAATGCCATTCTTCGGTTTCTCTGTGAACATGAGGTCTCTTG 
ATACATTGTTAGCATGTCCTAAAATTGATGAAGCCTTATTTAACTCTACATCTGTAGAGCTGACGAAAC 
ATATGGGGAAATC I I 1 I 1 I'll ACAAAATTCTAAGgtatactgtgtaactgaataatagctgacaaataatcagATCGAGCCTTGC 
ATCCTTTGCACAAGTATTTATTGACATTACCCACAATTCAAAATTCAATTCTTGCTGCAATATATATAG 
GCTAGGATACTCTATGTGTATGAGAGCACAAGCATACTTAAAAAGGATGAAGGATATATTTATTCCCC 
AAAGAATGTTCATAACGGgtgagtacnannaactagaaaagtcanaattaaccttagATC mm GAATGTTATTGGAAGAAAA 
ATTTGGAAAAAGTTGGCCGAAATATTAGGATATACGAGTAGGCGTTTCTTGTCCTCTGCAGAAGTCAA 
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FIGURE 31 



EST2 pep 

Euplotes pep 

Trans of tetrahymen 

Consensus 



FFYCTEISST VTXVYFRHDT V1X KLIT V FIVE YFK-TYLVEN 

FFYVTEQQKS YS KTYYYRKN IWDVI-MKMS IAD LKK ETLA--EVQE 

KHKE GSQIFYYRKP IWKLVSKLTI VKVRIQFSEK NIKQMKMNFYQ 

FFY.TE..K. .^..YYYKK. IW...-KL F..K V. . 
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Euplotes pep 

Trans of tetrahymen 

Consensus 
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Consensus 
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K. 



GFAPGKfJ: 

sfqkypqg?£ 
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FREfctfTFNKK 
£BEI yTTFLRK 

tesUiMTF.RK 



ADEEEFTIYK ENHKNAIQP5J5 
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S-l : FFY VTE TIP QKN RLF FYR KSV WSK 

S-2: RQH LKR VQL RDV SEA EVR QHR EA 

S-3: ART FRR EKR AER LTS RVK ALF SVL NYE 

A-l : AJCF LHW LMS VYV VEL LRS FFY VTE TTF Q 
A-2: LFF YRK SVW SKL QSI GIR QHL KRV QLR DVS 
A-3: PAL LTS RLR FTP KPD GLR PIV NMD YVV 
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82 



Vector 



Genomic DNA Insert 



A5 



5625 bp Sequenced 
-2 kb Hind III Fragment 



Vector 



1 kb 



--ft 



tezi< 



RT Motifs 12 3(A) 4(B') 5(C) '6(D) 
i / \ \ i — * 



Introns 




7 8 9 10 11 



RT-PCR w/ M2-B14 
RT-PCR w/ M2-B15 Bdnc 
RT-PCR w/ M2-B15 Bene 
RT-PCR w/ M2-B16 Bene 



Hind III 




500 bp 
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Poly 4 

t t c 

t a a g c c teg 
5'- cag acc aaa gga att cca taa gg 
QTKGIPQG 



4(B f ) 



5(C) 

D D Y L L I T 
ctg ctg atg gag gag tag tgg -5 
a a aaaaa a a* 
t t t t 
c c 
Polv 1 
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FIGURE 36 



PCR Product M2 showed Reasonable Match 
with Other Telomerase Proteins 

LCVSYILSSFYYANLEENALQFLRKESMDPEKPETNLLMRLT 
Ea_pl23 KGIPQGLCVSSILSSFYYATLEESSLGFLRDESMNPENPNVNLLMRLTDDYLLIT 

Sp - M2 SILSSFLCHFYMEDLIDEYLSFTKKK GSVLLRW 

S C- pl03 DGLFQGSSLSAPIVDLVYDDLLEFYSEFKASPS QDTLILKLADDFLIIS 



J 3 K V G I P Q G 

Ucaa aaa gcc ggc acc ccc cag gg <— Actual Genomic Sequence. 

^&>olv 4 

'■4 t t c 

=R a a g c c teg 

||pag acc aaa gga acc cca caa gg > 

r p ag acc aaa gga acc cca cca ggC TCA ATT CTG TCA TCT TTT TTG TGT CAT TTC TAT " ATG 

s tc Cgg CCC ccc Caa ggt age ccG AGT TAA GAC AGT AGA AAA AAC ACA GTA AAG ATA TAC 



M 



;JAA GAT TTG ATT GAT GAA TAC CTA TCG TTT ACG AAA AAG AAA GGA TCA GTG TTG TTA CGA 
CTT CTA AAC TAA CTA CTT ATG GAT AGC AAA TGC TTT TTC TTT CCT AGT CAC AAC AAT GCT 



I 



K K K 



GTA GTC gac gac cac ccc ccc acc acc 
CAT CAG ctg ctg a eg gag gag tag tgg 

Vv DDYLLrT 

< ctg ctg aeg gag gag cag egg 

a a aaaaa a a 
t e c c 
c c 



..gac gac tec ccc ccc aca aca 
„DDFLFIT 



< Accual Genomic Sequence. 
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3' RT PCR Strategy 



mRNA 



5' 



IAAAAAXAAXAAAAAAA 

rmumiuui 




Sy Synthesis ol cONA with Qj Primer. 
%U mRNA 



3* 



5' 

_C 3' 



I AAAAAAAAAAAAAAAA. 

m - mimiuui ■ 



"fc First Round PCR Using Outside Primer and Qq Primer. 



3' 



r rn x iii ' rrrTii ix 



Second Round PCR Using Inside Primer and Qj Primer. 



4. Sequence Second Round PCR Products Using Inside Primer or Q| Primer. 
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-Size Selected Libraries from P. Nurese 
3 - 4 kb 
5 - 6 kb 
7 - 8 kb 
1 1 - 1 2 kb 

-Libraries from J.A. Wise 
Sau 3a Partial Digest 
Hind 111 Partial Digest 



Q:DNA Libraries 



GAD (Gal Activation Domain) Library 
REP Library from R. Allshire 
REP81ES Library (old) 
REP81ES Library (new) 
REP41ES Library 
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D. 



"Original PCR 
-3" RT-PCR 




-> 4- 



500 bp 



Mg Cone.' 
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A5 B2 




Hind III Digested Positive Genomic Clones 




. .-:.W, 



* 
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5' RT PCR Strategy 



mftNA 



ddA- 



I AAAAAAAAAAAAAAAA. 



O 1 . Synthesis ol cONA with Specific Downstream Primer. 

mRNA 

3* MMB^^BHH 



5 ^^^^^^^^^^^^^^t^^ma^^m^^m^^mi^mKi^^m aaaaaaxaaaaaaaaa . 



iy2. Ugate Oligo with 5'-P and blocked 3' to cONA using T4 RNA Ugase. 
lQ3. First Round PCR 




4. Second Round PCR 
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Alignment of RT Domains from Telomerase Catalytic Subunifs. 

Motif 0 

S.p. Tezlp (429). WLYNSFIIPILQSFFYITES5DLRNRTv r YFRKDIW ...(35) 
S.c. Est2p (366). WLFRQLIPKIIQTFFYCTEISSTVT-I\. r YFRHI3TW ...(35)"'" 
E.a. pl23 (441) . WIFEDLWSL I RCFFYVTEQQKSYSKTYYYRKN IW . . (35) 

* #*>**★ *** 

Motif 1 Motif 2 K 
p hh h K hR h R 

S.p. Tezlp AVIRLLPKK — NTFRLITN-LRKRF ...(6i)... * ■ 

S.c. Est2p SKMRI I PKKSNNEFRI I AIPCRGAD . . . (62) 
E.a. pl23 GKLR1 1 PKK- -TTFRP IMTFNKKIV . . . (61) . . . 

* * * * * * * 

Motif 3 (A) AF 
h hDh GY la 
S.p. Tezlp XKYFVRIDIKSCYDRIKQDLMFRIVK ...(89)... 
S.c. Est2p ELYFMKFDVKSCYDSIPRMECMRILK . . . (75) 

E.a. pl23 KLFFATMDIEKCYDSVNREKLSTFLK ...(107)... 

* * *★* * 

Motif 4 (B ' ) 

hPQG pP hh h 
S.p. Tezlp YLQKVGIPQGSILS5FLCHFYMEDLIDEYLSF ...(6) 
S.c. Est2p YIREDGLFQGSSLSAPIVDLVYDDLLEFYSEF ...(8) 
E.a. pl23 YKQTKGIPQGLCVSSILSSFYYATLEESSLGF ...(14).!. 
***** * * 

Y Motif 5(C) Motif 6(D) 

h F DDhhh Gh h cK h 

S.p. Tezlp VLLRWDDFLFITVNIOCDAKKF^ (205) 
S.c. Est2p L I LKLADDFL 1 I STDQQQVIN IKKLAMGG FQ KYHAKANRDK I LAVS SQS "(173) 
E.a. pl23 L1MRLTDDYLLITTQENNAVLFIEKLINVSRENGF KFN"MKKLQTSFPLS (209 ) 



A 




IWliip : ::::::::: rTfT" TPKSB • LnF LEM 2^r,iti T " 

C» P123 . MEVDVDNQADHHQIHSALKTCEEIKE AM TU YSW n 

e..pi« „ R«Kv 1R cRNSso..aHj^...or"5^ysxg^" 
war : !^?c% v a v ? pn d §5^m 1 v ?^ s ?s5EBSk s »^» 

E-_pU3 « 8 V Aa PRDVNEEDFK v|liR K E V F S T G L M t K m 

Sp_T«1p ( , V VKOMFDEBf ERRB -inaMKBBSMHH Pfi CD auu 

n Bi.vEH88iUsDHo?«JH4gi{S|g?::::r» 

E-_pl23 .» " " "LAKTHLLBALSTQKQYFFOO En NOVflA MS _ 

E._p.» ... BEK..OHF L N M1H v ( .N W 7S!!iSS?J!B??l75aN P n: 
Sfi' ™ HHTVHMW LQW»H|iiOFOL l»AFOVKQ»HKVfflPL ,„ 

Sp_T»»1p » 3 VS QSTVVpKRt LKIrPLlFOTACni uMi « 

war = Kii^KdsstiauTbSII&ffiffllE 

E.J.1M x. <=NVySTK8^ L pl^^Rl;^7?S?"afBEEK^ 

-SSJ? Z ::::::::::::::: MltMimiir^' - 

E.J.1JJ m LTO- S3I RKNF oS KVKXYVEiHK H0 LI H K N UtiS E «o« 

E._pl»3 YhInRMD V I HKMS1A D L B3B TH AH V ofl K ElE EW £ 

Sp_T.21p m I kmqsnIkhl v sRSo tBrI v a SBSIHIhB I ■ c 

aw : iss::«;;if!H?Mf!js{ai;l:l!":;: 

Sp.T.Ilp .„ VBHjKYAT IHATSDRATKN „ 

Sc.E.IJp u. F vSsOYFFNTNTa I " 

E.J>1» u. Wtf M TAQ I LKRKKN I V I DSKNFRKKEMKDYFROK 2 

s ■oK.iLEOQOYP?Si N sif2SiSHoi:!;!!!;K Y ir H ^ 

E.J.123 ™ Ml E K L INVSR E Na&MFliM KML Q T Sjl PIBP 8 K F A t» 
E.J.1IJ „ K YQIM S V E EuR I V QD YC DW V Dft^lC THA t M P mi 

? :l l> ~ iS"lS2{8SKi{? B v Il!5J^f B SF ffl;"g"^. 

E.J.1J3 Ml MLR ISO t LCTLNL NMQ T K K A 8 MWL KHkBkH F 2 

E.J.IJ1 „. HmN N I Hh V F R Kill T TED FANXT L NKHf IROOYK 2 
Sp_T»t1p US AO AHLbIR M&3I iKHPOR MF ITDLLNVIOR KlwK K aia 

war 2 y^y! ilPI:!,^^ 

E.j.123 M , HR A F F KM L VttN I KDT I FQEEHYP DFlW S TKBtH F 

Sp T*rtp •» LHR rBJaB - 
Se_Ett2p in lYIHtlVN. 
E»J>123 1094 QSLIOYHA 
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FIGURE 43 

Disruption strategy for the putative telomerase genes. 



wild type telomerase gene 



wild type telomerase gene 



1 . Transform With linear fragment 
containing the telomerase gene 
disrupted with a LEU2 or ura4 marker. 



M 



telomerase gene::M 
wild type telomerase gene 




wild type telomerase gene 



2. Assay in selective media. 

3. Sporulate, and grow on 
selective media. 





M 









telomerase gene::M 



Wild Type (X2) 



Senescence? (X2) 



(These cells will show a senescence phenotype 

if the disrupted gene encodes a telomerase subunit.) 



fcu-te. H-f 





tezl+/tezl::ura4 
tezl::ura4+ 
tezl + 
tezl + 
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tezl + 

tezl::ura4+ 
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Tezl disruption causes progressive 
shortening of telomeres in S. pombe 



Tetrad 



heterozygote diploid 
wt diploid 
wt haploid 



■ ■ • c c 

C C C <D <D 
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Generations 
after speculation 



tez1+ tezl+ tezl A 



Telomeric 

Apa I 
fragment 




600 bp 
500 bp 

400 bp 
300 bp 

200 bp 
100 bp 
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1 ggtaccgacttactttcccttctccataagctaattgctccctcgaacgctcctaaacctctggaaacatttttacaaga 80 
81 actcaataacaataccaagtcaaaccccaacatgaaggtgttattagtgatcgacaatatttccactttatcggtcgcta 160 
161 ccaagtataaggacaaaaagaacaacttccctccccctaaagacttttactttattaacttacttttcaaatatatttcg 240 
241 ggttcgcttacttttaatcgcggcactgttttagctgctacttctagccaaccgcgcgtttctaccccgtcattggatat 320 
321 agctcttggagcagctcacagaaaccctcacaaaccccctgatgagactataccagacccattacagtccgcgcatattc 400 
401 ttaacatggagccttacactttagatgagtcacgtcgcatgatggagtatttggcatcacccaacgtttgccttgaaaag 480 
4 81 gttgataattatttgcaaaaccacgtccttagcggcggtaatccgcgaaagctttctgacgcctgcacacgtctagcatg 560 
561 attgagatattcaaaaattcctacccactacaactcctttaacgcggttttatttttctattttctattctcatgttgtt 640 
641 ccaaacatgtatcaccncgcaticaggctttitttccgtitccactcctggaaccgcacctttttcaccactccccctaatga 720 
n 721 acaacccaaattagctccgctcataactgatagtagtagaaagattggtgactccacccgtgtaacgccactagcttaaa 800 
S801 gataccttgcaaaacattcattagccatcattatataaaaaaaatcctataactacaaatactaatcaacatttgcggtc 880 
^881 actatttatttaaaacgttatgatcagtaggacactttgcatatatatagttatgcttaatggttacttgtaacttgc 958 

Z?S0 ATG ACC GAA CAC CAT ACC CCC AAA AGC AGG ATT CTT CGC TTT CTA GAG AAT CAA TAT GTA 1018 
j^lMTEHHTPKSRI LRFLENQYV 20 

J 3 

01019 TAC CTA TGT ACC TTA AAT GAT TAT GTA CAA CTT GTT TTG AGA GGG TCG CCG GCA AGC TCG 1078 
P21YLCTLNDYVQLVLRGS PAS'S 40 

^1079 TAT AGC AAT ATA TGC GAA CGC TTG AGA AGC GAT GTA CAA ACG TCC TTT TCT ATT TTT CTT 1138 
&41YSNICERLRSDVQTSFSIFL 60 

il39 CAT TCG ACT GTA GTC GGC TTC GAC AGT AAG CCA GAT GAA GGT GTT CAA TTT TCT TCT CCA 1198 
S61HSTVVGFDSKPDE*GVQFSSP 80 
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l 

met ser val tyr val val glu leu leu 
GCCAAGTTCCTGCACTGGCTG ATG AGT GTG TAC GTC GTC GAG CTG CTC 

10 20 
arg ser phe phe tyr val thr glu thr thr phe gin lys asn arg 
AGG TCT TTC TTT TAT GTC ACG GAG ACC ACG TTT CAA AAG AAC AGG 

30 

leu phe phe tyr arg lys ser val trp ser lys leu gin ser ile 
CTC TTT TTC TAC CGG AAG AGT GTC TGG AGC AAG TTG CAA AGC ATT 

40 50 
gly ile arg gin his leu lys arg val gin leu arg glu leu ser 
GGA ATC AGA CAG CAC TTG AAG AGG GTG CAG CTG CGG GAG CTG TCG 



glu ala glu val arg 
GAA GCA GAG GTC AGG 

70 

thr ser arg leu arg 
ACG TCC AGA CTC CGC 



60 

gin his arg glu ala 
CAG CAT CGG GAA GCC 



phe ile pro lys pro 
TTC ATC CCC AAG CCT 



arg pro ala leu leu 
AGG CCC GCC CTG CTG 

80 

asp gly leu arg pro 
GAC GGG CTG CGG CCG 



90 

ile val asn met asp tyr val 
ATT GTG AAC ATG GAC TAC GTC 

100 

glu lys ala glu arg leu 

GAA AAG ARG GCC GAG CGT CTC 



val gly ala arg thr phe arg arg 
GTG GGA GCC AGA ACG TTC CGC AGA 

110 

thr ser arg val lys ala leu phe 
ACC TCG AGG GTG AAG GCA CTG TTC 



ser val leu asn tyr 
AGC GTG CTC AAC TAC 

130 

ala ser val leu gly 
GCC TCT GTG CTG GGC 



phe val leu arg val 

TTC GTG CTG CGT GTG 

160 

phe- val lys val asp 

TTT GTC AAG GTG GAT 



120 

glu arg ala arg arg 
GAG CGG GCG CGG CGC 



leu asp asp ile his 
CTG GAC GAT ATC CAC 

150 

arg ala gin asp pro 
CGG GCC CAG GAC CCG 



val thr gly ala tyr 
GTG ACG GGC GCG TAC 



pro gly leu leu gly 

CCC GGC CTC CTG GGC 

140 

arg ala trp arg thr 

AGG GCC TGG CGC ACC 



pro pro glu leu tyr 
CCG CCT GAG CTG TAC 

170 

asp thr ile pro gin 
GAC ACC ATC CCC CAG 



180 

asp arg leu thr glu val ile ala ser ile ile lys pro gin asn 
GAC AGG CTC ACG GAG GTC ATC GCC AGC ATC ATC AAA CCC CAG AAC 
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Motif-1 
Eppl23 
Sp Tezl 
Sc Est2 
Hs TCP1 
consensus 



...LVVSLIRCFFYVTEQQKSYSKT... 
...FIIPILQSFFYITESSDLRNRT... 
...LIPKIIQTFFYCTEISSTVTIV... 
...YVVELLRSFFYVTETTFQKNRL... 
FFY TE 



K 

Motif 0 phhh K hRh R 

Ep pl23 ...KSLGFAPGKLRLIPKKT— TFRPIMTFNKKIV... 

SpTezl ...QKTTLPPAVIRLLPKKN—TFRLITNLRKjRFL.. 

Sc Est2 ...TLSNFNHSKMRIIPKKSNNEFRIIAIPCRGAD.. 

Hs TCP1 ...ARPALLTSRLRFIPKPD--GLRPIVNMDYVVG. 

consensus R PK. RI 

AF 

Motif A h hDh GY h 

Ep pi 23 . -PKLFFATMDIEKC YDS VNRJEKLSTFLK. . 

Sp Tezl ...RKKYFVRIDIKSCYDRIKQDLMFRIVK... 

Sc Est2 ...PELYFMKFDVKSCYDSIPRMECMRILK... 

Hs TCP I ...PELYFVKVDVTGAYDTIPQDRLTEVIA...//... 

consensus F D YD 

Motif B hPQG pS hh 

Ep pi 23 -.NGKFYKQTKGIPQGLCVSSILSSFYYA... 

Sp Tezl ...GNSQYLQKVGIPQGSILSSFLCHFYME 

Sc Est2 ...EDKCYIREDGLFQGSSLSAPIVDLVYD... 

HsTCPl ...RATSYVQCQGIPQGSILSTLLCSLCYG... 

consensus G QG S 
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Sp Tezl ...KKGSVLLRVVDDFLFITVNKKD. 

Sc Est2 -.SQDTLILKLADDFLIISTDQQQ... 

HsTCPl ...RRDGLLLRLVDDFLLVTPHLTH.. 
consensus DD L 

Motif D Ghh cK 

Ep p 1 23 ...NVSRENGFKFNMKKL... 

Sp Tezl . . . LNLS L RG FEKHNFST. . . 

Sc Est2 ...KKLAMGGFQKYNAKA... 

Hs TCP 1 ...LRTLVRGVPEYGCVV... 
consensus G 
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FIGURE 50 



1 GCAGCGCTGC GTCCTGCTGC GCACGTGGGA AGCCCTGGCC CCGGCCACCC 
5 1 CCGCGATGCC GCGCGCTCCC CGCTGCCGAG CCGTGCGCTC CCTGCTGCGC 
1 0 1 AGCCACTACC GCGAGGTGCT GCCGCTGGCC ACGTTCGTGC GGCGCCTGGG 
1 5 1 GCCCCAGGGC TGGCGGCTGG TGCAGCGCGG GGACCCGGCG GCTTTCCGCG 
201 CGNTGGTGGC CCANTGCNTG GTGTGCGTGC CCTGGGANGN ANGGCNGCCC 
25 1 CCCGCCGCCC CCTCCTTCCG CCAGGTGTCC TGCCTGAANG ANCTGGTGGC 
30 1 CCGAGTGCTG CANANGCTGT GCGANCGCGG CGCGAANAAC GTGCTGGCCT 
351 TCGGCTTCGC GCTGCTGGAC GGGGCCCGCG GGGGCCCCCC CGAGGCCTTC 
40 1 ACCACCAGCG TGCGCAGCTA CCTGCCCAAC ACGGTGACCG ACGCACTGCG 
4S 1 GGGGAGCGGG GCGTGGGGGC TGCTGCTGCG CCGCGTGGGC GACGACGTGC 
50 1 TGGTTCACCT GCTGGCACGC TGCGCGNTNT TTGTGCTGGT GGNTCCCAGC 
55 1 TGCGCCTACC ANGTGTGCGG GCCGCCGCTG TACCAGCTCG GCGCTGCNAC 
60 1 TC AGGCCCGG CCCCCQCCAC ACGCTANTGG ACCCGAANGC GTCTGGGATC 
65 1 CAACGGGCCT GGAACCATAG CGTCAGGGAG GCCGGGGTCC CCCTGGGCTG 
701 CC AGCCCCGG GTGCGAGGAG GCGCGGGGGC AGTGCCAGCC GAAGTCTGCC 
75 1 GTTGCCCAAG AGGCCCAGGC GTGGCGCTGC CCCTGAGCCG GAGCGGACGC 
80 1 CCGTTGGGCA GGGGTCCTGG GCCCACCCGG GCAGGACGCC TGGACCGAGT 
85 1 GACCGTGGTT TCTGTGTGGT GTCACCTGCC AGACCCGCCG AAGAAGCCAC 
90 1 CTCTTTGGAG GGTGCGCTCT CTGGCACGCG CCACTCCCAC CCATCCGTGG 
95 1 GCCGCCAGCA CCACGCGGGC CCCCCATCCA CATCGCGGCC ACCACGTCCT 
1 00 1 GGGAC ACGCC TTGTCCCCCG GTGTACGCCG AGACCAAGCA CTTCCTCTAC 
1 05 1 TCCTCAGGCG ACAAGNACAC TGCGNCCCTC CTTCCTACTC AATATATCTG 
1101 AGGCCC AGCC TGACTGGCGT TCGGGAGGTT CGTGGAGAC A NTCTTTCTGG 
1151 TTCCAGGCCT TGGATGCCAG GATTCCCCGC AGGTTGCCCC GCCTGCCCCA 
1 20 1 GCGNTACTGG CAAATGCGGC CCCTGTTTCT GGAGCTGCTT GGGAACCACG 
1 25 1 CGC AGTGCCC CTACGGGGTG TTCCTCAAGA CGC ACTGCCC GCTGCGAGCT 
1 30 1 GCGGTCACCC CAGCAGCCGG TGTCTGTGCC CGGGAGAAGC CCCAGGGCTC 
135 1 TGTGGCGGCC CCCGAGGAGG AGGAACACAG ACCCCCGTCG CCTGGTGCAG 
1 40 1 CTGCTCCGCC AGC AC AGC AG CCCCTGGCAG GTGTACGGCT TCGTGCGGGC 
1 45 1 CTGCCTGCGC CGGCTGGTGC CCCC AGGCCT CTGGGGCTCC AGGC AC AACG 
1 50 1 AACGCCGCTT CCTC AGGAAC ACCAAGAAGT TCATCTCCCT GGGGAAGCAT 
1 55 1 GCCAAGCTCT CGCTGCAGGA GCTGACGTGG AAGATGAGCG TGCGGGACTG 
1601 CGCTTGGCTG CGC AGG AGCC CAGGGGTTGG CTGTGTTCCG GCCGCAGAGC 
1 65 1 ACCGTCTGCG TGAGGAGATC CTGGCCAAGT TCCTGCACTG GCTGATGAGT 
1 70 1 GTGTACGTCG TCGAGCTGCT CAGGTCTTTC TTTTATGTCA CGGAGACCAC 
1 75 1 GTTTCAAAAG AACAGGCTCT TTTTCTACCG GAAGAGTGTC TGGAGCAAGT 
1 80 1 TGCAAAGCAT TGGAATCAGA CAGCACTTGA AGAGGGTGCA GCTGCGGGAG 
1 85 1 CTGTCGG AAG CAGAGGTCAG GCAGCATCGG GAAGCCAGGC CCGCCCTGCT 
1901 GACGTCCAGA CTCCGCTTCA TCCCCAAGCC TGACGGGCTG CGGCCGATTG 
195 1 TGAACATGGA CTACGTCGTG GGAGCCAGAA CGTTCCGCAG AGAAAAGAGG 
200 1 GCCGAGCGTC TCACCTCGAG GGTGAAGGCA CTGTTCAGCG TGCTC AACTA 
205 1 CGAGCGGGCG CGGCGCCCCG GCCTCCTGGG CGCCTCTGTG CTGGGCCTGG 
2101 ACG ATATCCA CAGGGCCTGG CGC ACCTTCG TGCTGCGTGT GCGGGCCC AG 
J 1 5 1 GACCCGCCGC CTG AGCTGTA CTTTGTC AAG GTGGATGTGA CGGGCGCGTA 
220 1 CGACACCATC CCCCAGGACA GGCTCACGGA GGTCATCGCC AGCATCATCA 
225 1 AACCCCAGAA CACGTACTGC GTGCGTCGGT ATGCCGTGGT CCAGAAGGCC 
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FIGURE 51 (cont.) 
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BSPHl 6951 
APAL1 6733, 



BSPHl 5943 



APAL1 
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FIGURE 53 



1 

met 

GCAGCGCTGCGTCCTGCTGCGCACGTGGGAAGCCCTGGCCCCGGCCACCCCCGCG ATG 

10 

pro arg ala pro arg cys arg ala val arg ser leu leu arg ser 
CCG CGC GCT CCC CGC TGC CGA GCC GTG CGC TCC CTG CTG CGC AGC 

20 * 30 

his tyr arg glu val leu pro leu ala thr phe val arg arg leu 
CAC TAG CGC GAG GTG CTG CCG CTG GCC ACG TTC GTG CGG CGC CTG 

40 

gly pro gin gly trp arg leu val gin arg gly asp pro ala ala 
GGG CCC CAG GGC TGG CGG CTG GTG CAG CGC GGG GAC CCG GCG GCT 

50 60 
phe arg ala leu val ala gin cys leu val cys val pro trp asp 
TTC CGC GCG CTG GTG GCC CAG TGC CTG GTG TGC GTG CCC TGG GAC 

70 

ala arg pro pro pro ala ala pro ser phe arg gin val ser cys 
GCA CGG CCG CCC CCC GCC GCC CCC TCC TTC CGC CAG GTG TCC TGC 

80 90 
leu lys glu leu val ala arg val leu gin arg leu cys glu arg 
CTG AAG GAG CTG GTG GCC CGA GTG CTG CAG AGG CTG TGC GAG CGC 

100 

gly ala lys asn val leu ala phe gly phe ala leu leu asp gly 
GGC GCG AAG AAC GTG CTG GCC TTC GGC TTC GCG CTG CTG GAC GGG 

110 120 
ala arg gly gly pro pro glu ala phe thr thr ser val arg ser 
GCC CGC GGG GGC CCC CCC GAG GCC TTC ACC ACC AGC GTG CGC AGC 



13C 

cyr leu pro asn thr val thr asp ala leu arg gly sar gly ala 
TAG CTG CCC AAC ACG GTG ACC GAC GCA CTG CGG GGG AGC GGG GCG 
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FIGURE 58 



met 

GCAGCGCTGCGTCCTGCTGCGCACGTGGGAAGCCCTGGCCCCGGCCACCCCCGCG ATG 

10 

pro arg ala pro arg cys arg ala val arg ser leu leu a*-g ser 
CCG CGC GCT CCC GGC TGC CGA GCC GTG CGC TCC CTG CTG CGC AGC 

his tyr arg glu val leu pro leu ala thr phe val arg arc leu 
CAC TAG CGC GAG GTG CTG CCG CTG GCC ACG TTC GTG CGG CGC CTG 

40 

gly pro gin gly trp arg leu val gin arg gly aso pro ala ala 
GGG CCC CAG GGC TGG CGG CTG GTG CAG CGC GGG GAC CCG GCC- GCT 

phe arg ala leu val ala gin cys leu val cys val oro trp asp 
TTC CGC GCG CTG GTG GCC CAG TGC CTG GTG TGC GTG CCC TGC- GAC 

70 •• 

ala arg pro pro pro ala ala pro ser phe arg gin val ser cys 
GCA CGG CCG CCC CCC GCC GCC CCC TCC TTC CGC CAG GTG TCC TGC 



80 



90 



leu lys glu leu val aia arg val leu gin arg leu cys glu arg 
CTG AAG GAG CTG GTG GCC CGA GTG CTG CAG AGG CTG TGC GAG CGC 

100 

gly ala lys asn val leu ala phe gly phe ala leu leu asp gly 
GGC GCG AAG AAC GTG CTG GCC TTC GGC TTC GCG CTG CTG GAC GGG 



110 



120 



ala arg gly gly pro pro glu ala phe thr thr ser val arg ser 
GCC CGC GGG GGC CCC CCC GAG GCC TTC ACC ACC AGC GTG CGC AGC 



. -J U 



cyr leu pro asn thr val thr asp ala leu arg gly ser crly ala 
TAC CTG CCC AAC ACG GTG ACC GAC GCA CTG CGG GGG AGC GGG GCG 
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FIGURE 59 

(page 1 of 7 for Fig. 59: SEQ ID NO: 122/123) 

1 

met 

GCAGCGCTGCGTCCTGCTGCGCACGTGGGAAGCCCTGGCCCCGGCCACCCCCGCG 
10 

pro arg ala pro arg cys arg ala val arg ser leu leu arg ser 

CCG CGC GCT CCC CGC TGC CGA GCC GTG CGC TCC CTG CTG CGC AGC 

20 30 
his tyr arg glu val leu pro leu ala thr phe val arg arg leu 

CAC TAC CGC GAG GTG CTG CCG CTG GCC ACG TTC GTG CGG CGC CTG 
40 

gly pro gin gly trp arg leu val gin arg gly asp pro ala ala 

GGG CCC CAG GGC TGG CGG CTG GTG CAG CGC GGG GAC CCG GCG GCT 

50 60 
phe arg ala leu val ala gin cys leu val cys val pro trp asp 

TTC CGC GCG CTG GTG GCC CAG TGC CTG GTG TGC GTG CCC TGG GAC 
70 

ala arg pro pro pro ala ala pro ser phe arg gin val ser cys 

GCA CGG CCG CCC CCC GCC GCC CCC TCC TTC CGC CAG GTG TCC TGC 

80 90 
leu lys glu leu val ala arg val leu gin arg leu cys glu arg 

CTG AAG GAG CTG GTG GCC CGA GTG CTG CAG AGG CTG TGC GAG CGC 
100 

gly ala lys asn val leu ala phe gly phe ala leu leu asp gly 

GGC GCG AAG AAC GTG CTG GCC TTC GGC TTC GCG CTG CTG GAC GGG 

HO 120 
ala arg gly gly pro pro glu ala phe thr thr ser val arg ser 

GCC CGC GGG GGC CCC CCC GAG GCC TTC ACC ACC AGC GTG CGC AGC 
130 

- tyr leu pro asn thr val thr asp ala leu arg gly ser gly ala 
TAC CTG CCC AAC ACG GTG ACC GAC GCA CTG CGG GGG AGC GGG GCG 



FIGURE 60 
MOUSE TRT SEQUENCE 



(SEQ ID NO: 124; page 1 of 2) 



1 

51 

101 

151 

201 

251 

301 

351 

401 

451 

501 

551 

601 

651 

701 

751 

801 

851 

901 

951 

1001 

1051 

1101 

1151 

1201 

1251 

1301 

1351 

1401 

1451 

1501 

1551 

1601 

1651 

1701 

1751 

1801 

1851 

1901 



CGGGTGGGAGGCCCATCCCGGCCTTGAGCACAATGACCCGCGCTCCTCGT 

TGCCCCGCGGTGCGCTCTCTGCTGCGCAGCCGATACCGGGAGGTGTGGCC 

GCTGGCAACCTTTGTGCGGCGCCTGGGGCCCGAGGGCAGGCGGCTTGTGC 

AACCCGGGGACCCGAAGATCTACCGCACTTTGGTTGCCCAATGCCTAGTG 

TGCATGCACTGGGGCTCACAGCCTCCACCTGCCGACCTTTCCTTCCACCA 

GGTGTCATCCCTGAAAGAGCTGGTGGCCAGGGTTGTGCAGAGACTCTGCG 

AGCGCAACGAGAGAAACGTGCTGGCTTTTGGCTTTGAGCTGCTTAACGAG 

GCCAGAGGCGGGCCTCCCATGGCCTTCACTAGTAGCGTGCGTAGCTACTT 

GCCCAACACTGTTATTGAGACCCTGCGTGTCAGTGGTGCATGGATGCTAC 

TGTTGAGCCGAGTGGGCGACGACCTGCTGGTCTACCTGCTGGCACACTGT 

GCTCTTTATCTTCTGGTGCCCCCCAGCTGTGCCTACCAGGTGTGTGGGTC 

TCCCCTGTACCAAATTTGTGCCACCACGGATATCTGGCCCTCTGTGTCCG 

CTAGTTACAGGCCCACCCGACCCGTGGGCAGGAATTTCACTAACCTTAGG 

TTCTTACAACAGATCAAGAGCAGTAGTCGCCAGGAAGCACCGAAACCCCT 

GGCCTTGCCATCTCGAGGTACAAAGAGGCATCTGAGTCTCACCAGTACAA 

GTGTGCCTTCAGCTAAGAAGGCCAGATGCTATCCTGTCCCGAGAGTGGAG 

GAGGGACCCCACAGGCAGGTGCTACCAACCCCATCAGGCAAATCATGGGT 

GCCAAGTCCTGCTCGGTCCCCCGAGGTGCCTACTGCAGAGAAAGATTTGT 

CTTCTAAAGGAAAGGTGTCTGACCTGAGTCTCTCTGGGTCGGTGTGCTGT 

AAACACAAGCCCAGCTCCACATCTCTGCTGTCACCACCCCGCCAAAATGC 

CTTTCAGCTCAGGCCATTTATTGAGACCAGACATTTCCTTTACTCCAGGG 

GAGATGGCCAAGAGCGTCTAAACCCCTCATTCCTACTCAGCAACCTCCAG 

CCTAACTTGACTGGGGCCAGGAGACTGGTGGAGATCATCTTTCTGGGCTC 

AAGGCCTAGGACATCAGGACCACTCTGCAGGACACACCGTCTATCGCGTC 

GATACTGGCAGATGCGGCCCCTGTTCCAACAGCTGCTGGTGAACCATGCA 

GAGTGCCAATATGTCAGACTCCTCAGGTCACATTGCAGGTTTCGAACAGC 

AAACCAACAGGTGACAGATGCCTTGAACACCAGCCCACCGCACCTCATGG 

ATTTGCTCCGCCTGCACAGCAGTCCCTGGCAGGTATATGGTTTTCTTCGG 

GCCTGTCTCTGCAAGGTGGTGTCTGCTAGTCTCTGGGGTACCAGGCACAA 

TGAGCGCCGCTTCTTTAAGAACTTAAAGAAGTTCATCTCGTTGGGGAAAT 

ACGGCAAGCTATCACTGCAGGAACTGATGTGGAAGATGAAAGTAGAGGAT 

TGCCACTGGCTCCGCAGCAGCCCGGGGAAGGACCGTGTCCCCGCTGCAGA 

GCACCGTCTGAGGGAGAGGATCCTGGCTACGTTCCTGTTCTGGCTGATGG 

ACACATACGTGGTACAGCTGCTTAGGTCATTCTTTTACATCACAGAGAGC 

ACATTCCAGAAGAACAGGCTCTTCTTCTACCGTAAGAGTGTGTGGAGCAA 

GCTGCAGAGCATTGGAGTCAGGCAACACCTTGAGAGAGTGCGGCTACGGG 

AGCTGTCACAAGAGGAGGTCAGGCATCACCAGGACACCTGGCTAGCCATG 

CCCATCTGCAGACTGCGCTTCATCCCCAAGCCCAACGGCCTGCGGCCCAT 

TGTGAACATGAGTTATAGCATGGGTACCAGAGCTTTGGGCAGAAGGAAGC 




FIGURE 53 (cont.) 



140 150 
crp gly leu leu leu arg arg val gly asp asp vai leu val his 
TGG GGG CTG CTG CTG CGC CGC GTG GGC GAC GAC GTG CTG GTT CAC 

160 

leu leu ala arg cys ala leu phe val leu val ala pro ser cys 
CTG CTG GCA CGC TGC GCG CTC TTT GTG CTG GTG GCT CCC AGC TGC 

170 180 
ala cyr gin val cys gly pro pro leu cyr gin ieu gly ala ala 
GCC TAC CAG GTG TGC GGG CCG CCG CTG TAG CAG CTC GGC GCT GCC 

190 

thr gin ala arg pro pro pro his ala ser giy pro arg arg arg 
ACT CAG GCC CGG CCC CCG CCA CAC GCT AGT GGA CCC CGA AGG CGT 

200 210 
leu gly cys glu arg ala trp asn his ser val arg glu ala gly 
CTG GGA TGC GAA CGG GCC TGG AAC CAT AGC GTC AGG GAG GCC GGG 

220 

val pro leu gly leu pro ala pro gly ala arg arg arg gly gly 
GTC CCC CTG GGC CTG CCA GCC CCG GGT GCG AGG AGG CGC GGG GGC 

230 240 
ser ala ser arg ser leu pro leu pro lys arg pro arg arg gly 
AGT GCC AGC CGA AGT CTG CCG TTG CCC AAG AGG CCC AGG CGT GGC 

250 

ala ala pro glu pro glu arg chr pro val gly gin giy ser ctd 
GCT GCC CCT GAG CCG GAG CGG ACG CCC GTT GGG CAG GGG TCC TGG 

260 270 
ala his pro gly arg chr arg gly pro ser asp arg gly phe cys 
3CC CAC CCG GGC AGG ACG CGT GGA CCG AGT GAC CGT GGT TTC TGT 

230 

val vai ser pro ala arg pro ala glu glu ala chr ser leu glu 
3TG GTG TCA CCT GCC AGA CCC GCC GAA GAA GCC ACC TCT TTG GAG 
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giy ala leu ser gly chr arg 
GGT GCG CTC TCT GGC ACG CGC 



gin his his ala gly pro pro 
CAG CAC CAC GCG GGC CCC CCA 

320 

crp asp chr pro cys pro pro 
TGG GAC ACG CCT TGT CCC CCG 



leu tyr ser ser gly aso lys 
CTC TAC TCC TCA GGC GAC AAG 

350 

leu ser ser leu arg pro ser 
CTC AGC TCT CTG AGG CCC AGC 



glu chr ile phe leu gly ser 
GAG ACC ATC TTT CTG GGT TCC 

330 

arg arg leu pro arg leu pro 
CGC AGG TTG CCC CGC CTG CCC 



leu phe leu glu leu leu gly 
CTG TTT CTG GAG CTG CTT GGG 

410 

val leu leu lys chr his cys 
J3TG CTC CTC AAG ACG CAC TGC 



ala ala gly val cys ala arg 
GCA GCC GGT GTC TGT GCC CGG 



300 

his ser his pro ser val gly arg 
CAC TCC CAC CCA TCC GTG GGC CGC 

310 

ser thr ser arg pro pro arg pro 
TCC ACA TCG CGG CCA CCA CGT CCC 

330 

val cyr ala glu chr lys his phe 
GTG TAC GCC GAG ACC AAG CAC TTC 

340 

glu gin leu arg pro ser phe leu 
GAG CAG CTG CGG CCC TCC TTC CTA 

360 

leu chr gly ala arg arg leu val 
CTG ACT GGC GCT CGG AGC- CTC GTG 

370 

arg pro crp mec pro gly chr pro 
AGG CCC TGG ATG CCA GGG ACT CCC 

390 

gin arg tyr crp gin mec arg pro 
CAG CGC TAC TGG CAA ATG CGG CCC 

400 

asn his ala gin cys pro tyr gly 
AAC CAC GCG CAG TGC CCC TAC GGG 

420 

pro leu arg aia ala val chr pro 
CCG CTG CGA GCT GCG GTC ACC CCA 

430 

glu lys pro gin gly ser val ala 
GAG AAG CCC CAG GGC TCT GTG GCG 
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440 

aia pre glu glu glu asp chr 
GCC CCC GAG GAG GAG GAC ACA 



leu arg gin his ser ser pro 
CTC CGC CAG CAC AGC AGC CCC 

470 

ala cys isu arg arg leu val 
GCC TGC CTG CGC CGG CTG GTG 



his asn glu arg arg phe leu 
CAC AAC GAA CGC CGC TTC CTC 

500 

leu giy lys his ala lys leu 
CTG C-GG AAG CAT GCC AAG CTC 



mec ser val arg asp cys ala 
ATG AGC GTG CGG GAC TGC GCT 

530 

giy cys val pro ala ala glu 
GGC TGT GTT CCG GCC GCA GAG 

ala lys phe lau his crp leu 
GCC AAG TTC CTG CAC TGG CTG 

560 

leu arg ser phe phe cyr val 
CTC AGG TCT TTC TTT TAT GTC 



arg leu phe phe cyr arg pro 
AGG CTC TTT TTC TAC CGG CCG 

590 

ile giy :1a arg gin his leu 
ATT -GGA ATC AGA CAG CAC TTG 



450 

asp pro arg arc lau val gin leu 
GAC CCC CGT CGC CTG GTC- CAG CTG 

460 

trp gin val cyr giy phe val arg 
TGG CAG GTG TAC GGC TTC GTG CGG 

480 

pro pro giy leu trp giy ser arg 
CCC CCA GGC CTC TGG GGC TCC AGG 

490 

arg asn thr lys lys phe ile ser 
AGG AAC ACC AAG AAG TTC ATC TCC 

510 

ser leu gin glu leu chr crp lys 
TCG CTG CAG GAG CTG ACG TGG AAG 

520 

trp leu arg arg ser pro giy val 
TGG CTG CGC AGG AGC CCA GGG GTT 

540 

his arg leu arg glu giu ile leu 
CAC CGT CTG CGT GAG GAG ATC CTG 

550 

met ser val cyr val val giu leu 
ATG AGT GTG TAC GTC GTC GAG CTG 

570 

thr glu thr thr phe gin lys asn 
ACG GAG ACC ACG TTT CAA AAG AAC 

580 

ser val trp ser lys leu gin ser 
AGT GTC TGG AGC AAG TTG CAA AGC 

600 

lys arg val gin leu arg glu leu 
AAG AGG GTG CAG CTG CGG GAG CTG 
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ser giu ala glu val arg gin 
TCG GAA GCA GAG GTC AGG CAG 

620 

leu thr ser arg leu arg phe 
CTG ACG TCC AGA CTC CGC TTC 



pre ile val asn met asp tyr 
CCG ATT GTG AAC ATG GAC TAC 

650 

arg glu lys arg ala glu arg 
AGA GAA AAG AGG GCC GAG CGT 



phe ser val leu asn cyr glu 
TTC AGC GTG CTC AAC TAC GAG 

680 

gly ala ser val leu gly leu 
GGC GCC TCT GTG CTG GGC CTG 



chr phe val leu arg val arg 
ACC TTC GTG CTG CGT GTG CGG 

710 

tyr phe val lys val asp val 
TAC TTT GTC AAG GTG GAT GTG 



gin asp arg leu thr glu val 
CAG GAC AGG CTC ACG GAG GTC 

740 

asn thr tyr cys val arg arg 
AAC ACG TAC TGC GTG CGT CGG 



his gly his val arg lys ala 
CAT GGG CAC GTC CGC AAG GCC 



610 

his arg glu ala arg pro ala leu 
CAT CGG GAA GCC AGG CCC GCC CTG 

630 

ile pro lys pro asp gly leu arg 
ATC CCC AAG CCT GAC GGG CTG CGG 

640 

val val gly "ala arg thr phe arg 
GTC GTG GGA GCC AGA ACG TTC CGC 

660 

leu thr ser arg val lys ala leu 
CTC ACC TCG AGG GTG AAG GCA CTG 

670 

arg ala arg arg pro gly leu leu 
CGG GCG CGG CGC CCC GGC CTC CTG 

6S0 

asp asp ile his arg ala trp arg 
GAC GAT ATC CAC AGG GCC TGG CGC 

700 

ala gin asp pro pro pro glu leu 
GCC CAG GAC CCG CCG CCT GAG CTG 

720 

thr gly ala tyr asp thr ile pro 
ACG GGC GCG TAC GAC ACC ATC CCC 

730 

ile ala ser ile ile lys pro gin 
ATC GCC AGC ATC ATC AAA CCC CAG 

750 

tyr ala val val gin lys ala ala 
TAT GCC GTG GTC CAG AAG GCC GCC 

760 

phe lys ser his val ser thr leu 
TTC AAG AGC CAC GTC TCT ACC TTG 
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770 

thr asp leu gin pro tyr met 
ACA GAC CTC CAG CCG TAC ATG 



glu thr ser pro leu arg asp 
GAG ACC AGC CCG CTG AGG GAT 

500 

ser leu asn glu ala ser ser 
TCC CTC- AAT GAG GCC AGC AGT 



one met cys his his ala val 
TTC ATG TGC CAC CAC GCC GTG 

830 

gin cys gin gly ile pro gin 
CAG TGC CAG GGG ATC CCG CAG 



cys ser leu cys tyr gly asp 
TGC AGC CTG TGC TAC GGC GAC 

360 

ile arg arg asp gly leu leu 
ATT CGG CGG GAC GGG CTG CTC 



leu val thr pro his leu thr 
TTG GTG ACA CCT CAC CTC ACC 

390 

leu val arg gly val pro glu 
CTG GTC CGA GGT GTC CCT GAG 



lys thr val val asn phe pro 
AAG ACA GTG GTG AAC TTC CCT 

920 

thr ala phe val gin met pro 
ACG GCT TTT GTT CAG ATG CCG 



780 

arg gin phe val ala his leu gin 
CGA CAG TTC GTG GCT CAC CTG CAG 

790 

ala val val ile glu gin ser ser 
GCC GTC GTC ATC GAG CAG AGC TCC 

810 

gly leu phe asp val phe leu arg 
GGC CTC TTC GAC GTC TTC CTA CGC 

820 

arg ile arg gly lys ser tyr val 
CGC ATC AGG GGC AAG TCC TAC GTC 

840 

gly ser ile leu ser thr leu ieu 
GGC TCC ATC CTC TCC ACG CTG CTC 

850 

met glu asn lys leu phe ala gly 
ATG GAG AAC AAG CTG TTT GCG GGG 

370 

leu arg leu val asp asp phe leu 
CTG CGT TTG GTG GAT GAT TTC TTG 

880 

his ala lys thr phe ieu arg thr 
CAC GCG AAA ACC TTC CTC AGG ACC 

900 

tyr gly cys val val asn leu arg 
TAT GGC TGC GTG GTG AAC TTG CGG 

910 

val glu asp glu ala ieu gly gly 
GTA GAA GAC GAG GCC CTG GGT GGC 

930 

ala his gly leu phe pro trp cys 
GCC CAC GGC CTA TTC CCC TGG TGC 
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gly leu leu leu asp thr arg 
GGC CTG CTG CTG GAT ACC CGG 

950 

ser ser zyr ala arg thr ser 
TCC AGC TAT GCC CGG ACC TCC 



arg gly phe lys ala gly arg 
CGC GGC TTC AAG GCT GGG AGG 

980 

val Isu arg leu lys cys his 
GTC TTG CGG CTG AAG TGT CAC 



asn ser leu gin thr val cys 
AAC AGC CTC CAG ACG GTG TGC 

1010 

leu gin ala tyr arg phe his 
CTG CAG GCG TAC AGG TTT CAC 

his gin gin val trp lys asn 
CAT CAG CAA GTT TGG AAG AAC 

1040 

ser asp zhr ala ser leu cys 
TCT GAC ACG GCC TCC CTC TGC 



ala gly net ser leu gly ala 
GCA GGG ATG TCG CTG GGG GCC 

1070 

.ser glu ala val gin trp leu 
TCC GAG GCC GTG CAG TGG CTG 



leu -;hr arg his arg val thr 
CTG ACT CGA CAC CGT GTC ACC 



9 A f> 

thr leu glu val gin ser asp tyr 
ACC CTG GAG GTG CAG AGC GAC TAC 

960 

ile arg ala ser val thr phe asn 
ATC AGA GCC AGT GTC ACC TTC AAC 

970 

asn met arg arg lys leu phe gly 
AAC ATG CGT CGC AAA CTC TTT GGG 

990 

ser leu phe leu asp leu gin val 
AGC CTG TTT CTG GAT TTG CAG GTG 

1000 

thr asn ile cyr lys ile leu leu 
ACC AAC ATC TAC AAG ATC CTC CTG 

1020 

ala cys val leu gin leu pro phe 
GCA TGT GTG CTG CAG CTC CCA TTT 

1030 

pro thr phe phe leu arg val ile 
CCC ACA TTT TTC CTG CGC GTC ATC 

1050 

zyr ser ile leu lys ala lys asn 
TAC TCC ATC CTG AAA GCC AAG AAC 

1060 

lys gly aia ala gly pro leu oro 
AAG GGC GCC GCC GGC CCT CTG CCC 

1080 

cys his gin ala phe leu leu lys 
TGC CAC CAA GCA TTC CTG CTC AAG 

1090 

tyr val pro leu leu gly ser leu 
TAC GTG CCA CTC CTG GGG TCA CTC 
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1100 



1110 



arg chr ala gin thr gin leu ser arg lys leu pro gly chr chr 
AGG ACA GCC CAG ACG CAG CTG AGT CGG AAG CTC CCG GGG ACG ACG 

1120 

leu thr ala leu glu ala ala ala asn pre ala leu pro ser aso 
CTG ACT GCC CTG- GAG GCC GCA GCC AAC CCG GCA CTG CCC TCA GAC 



phe lys chr lie leu asp OP 

TTC AAG ACC ATC CTG GAC TGA TGGCCACCCGCCCACAGCCAGGCCGAGAGCAGA 

CACCAGCAGCCCTGTCACGCCC-GGCTCTACGTCCCAGGGAGGGAGGGGCC-GCCCACACCC 

AGGCCCGCACCGCTGGGAGTCTGAGGCCTGAGTGAGTGTTTGGCCGAGGCCTGCATGTCC 

GGCTGAAGGCTGAGTGTCCGGCTGAGGCCTGAGCGAGTGTCCAGCCAA.GGGCTGAGTGTC 

CAGCACACCTGCCGTCTTCACTTCCCCACAGGCTGGCC-CTCGGCTCCACCCCAGGGCCAG 

CTTTTCYTCACCAGGAGCCCGGCTTCCACTCCCCACATAGGAATAGTCCATCCCCAGATT 

CGCCATTGTTCACCCYTCGCCCTGCCYTCCTTTGCCTTCCACCCCCACCATCCAGGTGGA 

GACCCTGAGAAGGACCCTGGGAGCTCTGGGAATTTGGAGTGACCAAA^TGTGCCCTGTA 

CACAGGCGAGGACCCTGCACCTGGATGGGGGTCCCTGTGGGTCAAATTGGGGGGAGGTGC 

TGTGGGAGTAJLAATAC7GAJ\TATATGAGTTTTTCAGTTTTGRAAAA^ 

AAAAAAAAAA 



1130 



1132 
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FIGURE 51 (cont.) 
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FIGURE 50 (cont.) 



230 1 GCCCATGGGC ACGTCCGCAA GGCCTTCAAG AGCCACGTCT CTACCTTGAC 
III ! £ S~ CTCCAG CCGTACATGC GAC AGTTCGT GGCTCACCTG CAGGANAACA 
240 1 GCCCGCTGAG GGATGCCGTC GTCATCGAGC AGAGCTCCTC CCTGAATGAG 
245 1 GCCAGCAGTG GCCTCTTCGA CGTCTTCCTA CGCTTCATGT GCCACCACGC 
250 1 CGTGCGCATC AGGGGCAAGT CCTACGTCCA GTGCCAGGGG ATCCCGCAGG 
255 GCTCCATCCT CTCCACGCTG CTCTGCAGCC TGTGCTACGG CGACATGGAG 
260 1 AACAAGCTGT TTGCGGGGAT TCGGCGGGAC GGGCTGCTCC TGCGTTTGGT 
265 1 GGATGATTTC TTGTTGGTGA CACCTCACCT CACCCACGCG AAAACCTTCC 
2 70 1 TC AGGACCCT GGTCCGAGGT GTCCCTG AGT ATGGCTGCGT GGTG A ACTTG 
2751 CGG AAGA CAG TGGTGAACTT CCCTGTAGAA GACGAGGCCC TGGGTGGCAC 
280 1 GGCTTTTGTT CAGATGCCGG CCCACGGCCT ATTCCCCTGG TGCGGCCTGC 
285 1 TGCTGGATAC CCGGACCCTG GAGGTGCAGA GCGACTACTC CAGCTATGCC 
m I CGGACCTCCA TCAGAGCCAG TCTCACCTTC AACCGCGGCT TCA AGGCTGG 
2|5 1 GAGGAACATG CGTCGCAAAC TCTTTGGGGT CTTGCGGCTG AAGTGTCACA 
3J0 1 GCCTGTTTCT GGATTTGCAG GTGAACAGCC TCCAGACGGT GTGCACCAAC 
m 1 ATCTACAAGA TCCTCCTGCT GCAGGCGTAC AGGTTTCACG CATGTGTGCT 
3J 0 1 GCAGCTCCCA TTTCATC AGC AAGTTTGGAA GAACCCCACA TTTTTCCTGC 
m S 1 GCGTCATCTC TGAC ACGGCC TCCCTCTGCT ACTCCATCCT GAAAGCCAAG 
§20 1 AACGCAGGGA TGTCGCTGGG GGCCAAGGGC GCCGCCGGCC CTCTGCCCTC 
3j25 1 CGAGGCCGTG CAGTGGCTGT GCCACCAAGC ATTCCTGCTC AAGCTGACTC 
330 1 GACACCGTGT CACCTACGTG CCACTCCTGG GGTCACTCAG GACAGCCCAG 
M 5 1 ACGC AGCTGA GTCGGAAGCT CCCGGGGACG ACGCTGACTG CCCTGG AGGC 
340 1 CGCAGCCAAC CCGGCACTGC CCTCAGACTT C AAGACCATC CTGGACTG AT 
345 1 GGCCACCCGC CCACAGCCAG GCCGAGAGCA GACACCAGCA GCCCTGTCAC 
3|0 1 " GCCGGGCTCT ACGTCCCAGG GAGGGAGGGG CGGCCCACAC CCAGGCCCGC 
385 1 ACC GCTGGG A GTCTGAGGCC TGAGTGAGTG TTTGGCCGAG GCCTGCATGT 
3'60 1 CCGGCTGAAG GCTGAGTGTC CGGCTGAGGC CTGAGCGAGT GTCCAGCCAA 
365 1 GGGCTGAGTG TCCAGCACAC CTGCCGTCTT CACTTCCCCA CAGGCTGGCG 
370 1 CTCGGCTCCA CCCCAGGGCC AGCTTTTCCT CACCAGGAGC CCGGCTTCCA 
375 1 CTCCCCACAT AGGAATAGTC CATCCCCAGA TTCGCCATTG TTCACCCCTC 
3801 GCCCTGCCCT CCTTTGCCTT CC ACCCCC AC CATCC AGGTG GAG ACCCTG A 
385 1 GAAGGACCCT GGGAGCTCTG GGAATTTGGA GTGACCAAAG GTGTGCCCTG 
390 1 TACACAGGCG AGGACCCTGC ACCTGGATGG GGGTCCCTGT GGGTCAAATT 
3951 GGGGGGAGGT GCTGTGGGAG TAAAATACTG AATATATGAG 1111 1 CAGTT 
4001 TTGAAAAAAA AAAAAAAAAA AAAAAAAAA 
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FIGURE 47 (cont.) 



AGGCTGGCGTTCGGTCCACCCCAGGGCCAGCTTTTCCTCACCAGGAGCCCGGCTTCCACT 
CCCCACATAGGAATAGTCCATCCCCAGATTCGCCATTGTTCACCCTTCGCCCTGCCTTCC 
TTTGCCTTCCACCCCCACCATTCAGGTGGAGACCCTGAGAAGGACCCTGGGAGCTTTGGG 
AATTTGGAGTGACCAAAGGTGTGCCCTGTACACAGGCGAGGACCCTGCACCTGGATGGGG 
GTCCCTGTGGGTCAAATTGGGGGGAGGTGCTGTGGGAGTAAAATACTGAATATATGAGTT 

tttcagttttggaaaaaaaaaaaaaAaaaaaaaaaaaa 
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FIGURE 47 (cont.) 
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420 
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430 44Q 

1 CGG r?r iSS i SU leU his pro ST lu ser 3 ln Si" arg arg asp 
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4 50 

: GTC ?r? %iX %ll % ln gly arg arg arg P r ° ser ala leu arg gly 
S GTC GCT GGG GGC CAA GGG CGC CGC CGG CCC TCT GCC CTC CGA GGC 

' 460 „ n 
, , 4 70 

CGT CCA GTC GCT G^G g£ S aSc ATT g? GCT g£ GCT SS TCC 

ss ss ss ci? & si Sc? j& ss as sj. js ^ ss s 

490 , 500 

- oi 3 ^i 3 glu ser glu ala P ro Sly as P *sp ala asp cys 

CCA GAC GCA GCT GAG TCG GAA GCT CCC GGG GAC GAC GCT GAC TGC 

510 

r?? %nX %iX ~2 SSr gln pro gly thr ala ieu ar 9 leu gin asp 
CCT GGA GGC CGC AGC CAA CCC GGC ACT GCC CTC AGA CTT CAA GA? 

-. , 530 
nxs pro gly leu met ala thr arg pro gin pro olv aro si.. „in 
CAT CCT GGA CTG ATG GCC ACC CGC CCA cLg CCA GGC CgI IaG CAG 

S40 

Irl ?™ ^if ™ leU Ser arg arg a la Cyr thr ser gin gly gly 
ACA CCA GCA GCC CTG TCA CGC CGG GCT TAT ACG TCC CAG GGA GGG 

550 , 560 

a 5 g g i y g l y P ro his P ro 91/ leu his arg trp glu ser glu ala 
AGG GGC GGC CCA CAC CCA GGC CTG CAC CGC TGG GAG TCT GAG GCC 

5S4 
OP 

TGA GTGAGTGTTTGGCCGAGGCCTGCATGTCCGGCTGAAGGCTGAGTGTCCGGCTGAGGC 
CTGAGCGAGTGTCCAGCCAAGGGCTGAGTGTCCAGCACACCTGCGTTTTCACTTCCCCAC 
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FIGURE 47 (cont.) 



190 

thr tyr cys val arg 
ACG TAC TGC GTG CGT 

gly thr ser ala arg 
GGC ACG TCC GCA AGG 

220 

gin gly ile pro gin 
CAG GGG ATC CCG CAG 

leu cys tyr gly asp 
CTG TGC TAC GGC GAC 

250 

arg asp gly leu leu 
CGG GAC GGG CTG CTC 

thr pro his leu thr 
ACA CCT CAC CTC ACC 

280 

arg gly val pro glu 
CGA GGT GTC CCT GAG 

val val asn phe pro 
GTG GTG AAC TTC CCT 

310 

phe val gin met pro 
TTT GTT CAG ATG CCG 

leu leu asp thr arg 
CTG CTG GAT ACC CGG 

340 

tyr ala arg thr ser 
TAT GCC CGG ACC TCC 

phe lys ala gly arg 
TTC AAG GCT GGG AGG 

370 

arg leu lys cys his 
CGG CTG AAG TGT CAC 



arg tyr ala val val 
CGG TAT GCC GTG GTC 

210 

pro ser arg ala thr 
CCT TCA AGA GCC ACG 



gly ser ile leu ser 
GGC TCC ATC CTC TCC 

240 

met glu asn lys leu 
ATG GAG AAC AAG CTG 

leu arg leu val asp 
CTG CGT TTG GTG GAT 

270 

his ala lys thr phe 
CAC GCG AAA ACC TTC 



tyr gly cys val val 
TAT GGC TGC GTG GTG 

300 

val glu asp glu ala 
GTA GAA GAC GAG GCC 



ala his gly leu phe 
GCC CAC GGC CTA TTC 

330 

thr leu glu val gin 
ACC CTG GAG GTG CAG 



ile arg ala ser leu 
ATC AGA GCC AGT CTC 

360 

asn met arg arg lys 
AAC ATG CGT CGC AAA 



ser leu phe leu asp 
AGC CTG TTT CTG GAT 



200 

gin lys ala ala met 

CAG AAG GCC GCC ATG 

ser tyr val gin cys 

TCC TAC GTC CAG TGC 

230 

thr leu leu cys ser 

ACG CTG CTC TGC AGC 



phe ala gly ile arg 
TTT GCG GGG ATT' CGG 

260 

asp phe leu leu val 
GAT TTC TTG TTG GTG 

leu arg thr leu val 
CTC AGG ACC CTG GTC 

290 

asn leu arg lys thr 
AAC TTG CGG AAG ACA 

leu gly gly thr ala 
CTG GGT GGC ACG GCT 

320 

pro trp cys gly leu 
CCC TGG TGC GGC CTG 



ser asp tyr ser ser 
AGC GAC TAC TCC AGC 

350 

thr phe asn arg gly 
ACC TTC AAC CGC GGC 

leu phe gly val leu 
CTC TTT GGG GTC TTG 

380 

leu gin val asn ser 
TTG CAG GTG AAC AGC 



2/4 



FIGURE 46 (cont.) 



4666 
4746 
4826 



gccgacccccaaagcaagcacaccacaggactcctagcaaagcaaaactaaccccgctactagccctgatcgacccgtct 4745 
ttatccctatactcttaagaaagaccgacagcggccgctgaccactgcccacacgcccaccaaacgggagcggccaaaca 4825 
ttaaaagcaacacatgaggccaacccccccccattcagaacaaggaaagtggctccccataacgaacaatgcccgcacta 4905 
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lilt ttt CtaCCCatggCgSCC9 5=c"g="c t gagacgaaaagaaac t aaggacagcctgaacactaatagc CC ac ag a Ills 
Hit " 9 "r ataCaag9 " Ct9C "" CCCCgaCCtCaaC " tgCaCgggCgaaaag " 5225 

MM ^ccoa"^!^""" 9 ^^ 5305 
5386 ^f" 93 """ 993953 """""*^ 5385 

54« aaaaaa9aaaaCatCattgggagacaCctc "^^ aatca 5a t gcggaga gt atc C cc a gcgga C ccctgacgccaaca 5465 

5466 acttctacctccgaaacgtacggccccaccgccgcctcgacccctcgcagccccacgcagccaagcgaccaaaggcacc 5544 
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FIGURE 46 (cont.) 
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FIGURE 46- (cont.) 
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3344 actccttCCtag TT GAT ATG GTG CCT TTT GAA AAA GTC GTG CAG TTA CTT TCT ATG AAA ACA 3405 
644 DMVP FEKVVQLLSMKT 659 

3406 TCA GAT ACT TTG TTT GTT GAT TTT GTG GAT TAT TGG ACC AAA AGT TCT TCT GAA ATT TTT 3465 
660 SDTLFVDFVDYWTKSSSEIF 679 



3466 AAA ATG CTC AAG GAA CAT CTC TCT GGA CAC ATT GTT AAG gtataccaa tcgttgaattgcaataaca 3532 
6l0 KMLKEHLSGH IVK 692 
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FIGURE 30 (com.) 



ATGgtacgtgtcggtctcgagacttcagcaaiartgacacatcagGCI 1111 I GTCTTGGAATGAGAGATGGTTTGAAACCCTCTT 

TCAAATATC ATCC ATGCTTCGAACAGCTAATATACCAATTTCAGTCATTGACTGATCTTATCAAGCCGC 
TAAGACCAGTTTTGCGACAGGTGTTATTTTTACAT^ 

lanaciggtgtcrtaaacaaunanaciaagtattgctgacccccaaagcaag 

atccnaiacrmaagaaagangacagiggngctgactactgcccacatgcccatiaaacgggagiggrtaaacactaaaagtaatacatgaggcta 
aataaggaaagtggrmctataatgaataacgcccgcactaatgcaaaaagac^^ 

acccagigngngaagaaagcaaggataatnggaacaagcrtctgcagatgacaggctaaatmggtgaccgaattnggtaaaagccccaggnatccatggtggccg 
gccngciactgagacgaaaagaaactaaggatagmgaatactaaugctcattiaatgtcnatataaggtmgtrtmcctgacttcaa 

gtgnaagccanangganccgaaaiagccaaantcnggncctcaaagcggaagtctaaagaacttangaagcnatgaggcncaaaaactcctcctgatnaaaggag 

gaatcttccaccgatgaggaaatggatagcnatc^gctgctgaggagaagcctaatttmgcaaaaaagaaaatatcangggagacaictcngatgaat^ 

gagtatctccagcggatccngatgtcaataacnctamctgaaatgtatggtcctactgtcgcttcgacnctcgtagctctacgcagttaagigaccaaaggiacc 
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FIGURE 12 (cont.) 



TTAAACATTTC AAAAATTCC ACCTAAC AC AC AT AC ATTC ATTAAAATTCA T A T ATTATAC 

IMO 

A^TTTrTAAACTTTTTAACCTCCATTCrrCTCrrATCTAACTAAT^ 

LKI SKrPCKROTFlKIHlL" 
* RFOKrQVRElHSLKFrYYS- 

« i i I ivATTTCACACCTCTTATT IIC1H 1 ATCTTAACAATA I I t ' I " !TO ATTAGCTCGAA 

1141 . . 1200 

AAAAACT AAACTCTCC AC AAT AAAAC AAAAT AC AATTCTT AT AAAAAACT AA TCC AC CTT 

FFI SOLLFSrrLTIFro-UE 
FSFMSCYFLLS-OYFLISWK- 
FMFTAVIFFYt-NNI FCLACS- 

CTAAAAACTATCAAATAACACAACCCCTACACTCAC^TAACTTACCTTATTCACATTCAT 

• 1260 

CA* i i i i^ATACTTTATTCTCTTCCCCATCTCACTCCATTGAATCCAATAACTCn'AACTA 

VK SIK-EJCR-TEVT - L f H I H " - 
* KVSNKRSARLR * LSLFTFI - 
K K Y 0 I R E A LOCCN L A Y S H S • - 

i ■» c , ACATCCACC ^ CATATATCC >^ TA C G ^ 

1261 ...... .... . . . .......... 1320 

TCT AGCTC^ AAC^ AT AT ACCTT ATCCT AC^ATTCCTTTCTCCTC ACT ACC C AAAATTTTT 

RSTF ^lfPlRC-CNSSHPF-K 
D R P S Y I0YOOKETAV IRFK N - 
rOLHtSNTMIRKQOSSVLKt- 

ATCACCATACTCCTCATTT AAAAATCTCACTTCTTTACCT 1 J 8 ° 

" C Y E 0 * ^FRVKKWSRNLMQK 
SAHRTKFLES R N C A £ I L t K K - 
VLCCLNF-SOEMEPKS-SKR- 

. , a , CAATTCCCTCCATATTCCAAAACAATCCAACTCTAAATCTTTCCTTAATAACTATTACCA 

CTTAACCCACCTATAACCTTTTC 1 TACCTTGACATTTACAAAGCAATTATTCATAATCCT * * 

ELRSYCKRIEL" r f r • * V L P 

NCv = I*XESNSi<SF VNKYYO- 
IASI *-0XN RTLMLSL I S t T N - 

ATCTTCA7TCATTCAACACATTCACCACCCAACTCCACACAACATCATTAAACAAATAAA 
i<4 1 - - — — ............. 

TACAACTAACTAACTTCTCTAACTCCTCCCTTCACCTCTCTTCTACTAATTTCTTTATTT 

iLrOCRO-RCNCTEOM-Rfix 
SCLiEE lOEATAQX r I X £ I X . 
LOCLKRLTROLHRR S I. X K • S - 

l5Qi CTAAc» . » lATTAATTACACAATAAACTAAATTACTAATATACACATCACCCATCTTCAA 

CATTC AAAAT AATTAATCTCTTATTTCATTTAATCATTATATCTCTACTCCCTACAACTT ' 
v T r t :l - a lN'[TN(r (SOL0 
" LCLIRE-TKLLI - RSAIFN- 
Wf**LENKLNY- Y R O O •« S S I - 

TTCACCAAATAAAACCTCAACTAAACTTACACAATAAyAAAATACAAACCTTCCTC AAAAT 
AACTCCTTTATTrrCCACTTCATTTCAATCTCTTA I f II I 1 I ATC TTTC C AAC C ACTTTT A 

*-TK'XLW-S-TtKMTWLCOW 
CRNXSCTKVRO-KIOTLVKf 
OC t XAELKLONKK Y K P W S X Y - 

ATTCACC AACCAAAAC AACACC ACTT ACC AAAAC AAAAAAT AACCC AA T AAAT AAAATC A 

1621 1680 

TAACTCC f TCC 1 I'l I CTTCTCCTC AATCC 1 I 1 1 C I I' I I I I ATTCCCTTATTTATTTTACT 

ICCCXCOQLAKEX I A Cj - * K C 
LRK£XKTS*0KKK * C * X N £ 

CCR \RRPV$XRXMXA I VKns. 
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FIGURE 12 (cont.) 



GTACAGAACTCA>CAAATAAAACATTTA 1 1 1 1 1 IT CAATAATTTATTCAAAACACCCCTT 
168L - ... . . t 

CATCTCTTCACl Itl I 1 I'A 11 I"lX'l'AAATA>tAAAAACTTATTAAATAA C 1 I UL 1 CCCCAA 

d v O:<CRMKRFIFFNNLLKRCV 

b YRSCEIKOLFFS I IYCKECF- 

C TcVKK*K I YFFO" FI EKRCF* 

TTCCCCTTTTCCCCTTTTCCCC 

1762 

AACCCCAAAACCCCAAAACCCC 

a UCrWCFC 

t> WCTCVLC* 

c C V L C F W 
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FIGURE 12 (coot.) 

„ a , CATATGAC ^ TCACTCA ^ CCAT CTCCATACATCACACTTACCAAACACAAACTCCCTAT 

481 - 540 

CTATACTCTTACTCACTTTCCTACACCTATCTACTCTCAATCCTTTCTCTTTCACCCATA 

H M R MSORISfHOTrOROTRY 

I C ECVKCSRYlRLTKOKLAr- 
YCME SKOLOTSOLPKTNSL • - 

_ AA ^5ff^ CAAj ^ CT "^ 

TTTTCCCTTCTTTTTCAAACTATTACCTTCTCCTC^ 600 

* T o 0EKV C-SNSRRTYCIYYS 
K R tKKFONRTAEELlArTlR- 
NARK SLI IE00KNLLHLLFV- 

TATCCCTTTTATTACAATTCTTTTACCTATCCACCCTCAACTCCCCACTCTTCACACAAT 

601 660 

ATACCCAAAATAATCTTAACAAAATCCATACCTCCCACTT^ACCCCTCACAACTCTCTTA 

YCFYYNCFRY RRCTPESCON - 
MCrtT tVLC IOCELPSLETI - 
WVLLOLF'VSTVNSRVLROU- 

661 ^^ CCTC,I ^ ACA>CTCAAMA>TCCCAC ^ 

ACTTTTTCCACAAATCTTCACTTCCTTACCCTCAACACTTTCAACACTACACATACCCTA 

C KSCLQLKESQFCKF'CVCH 
E KAVYNCRMRSSESSOVYAI - 
KKLFTTECIAVLKVLMCMPL- 

72 1 ^I'TTfJ^^^^ 

ATAAAACACTTAATTACACTTTATACAATACACTTAAATTACCTATCCATATCTTTCTTT 

Y r F , * SOlSYLMLM °SYRNIC - 

„ C - lNLXrL lSI-WlAIETN- 
FC - LI SNILSOFNC - L • K 0 T - 

7 e i 55^7*^5 CATCC ^ C ^^ TCCMTATAC 

CCTTTATTTCCTACCTTCAAATTACCTTATATCCAATTTAGCAAACCCTCTTTACCTCTC 8 ^ 0 

P "* ?C: ' :rN ClYVKSFCTNAH 
° 1 N _ " ASi -"£rTL^ PLCOMHT- 
X ' * V V w n I R • I L W O K C T L - 

a 4 1 75.^777*7." T ^ CATTC ^ A ^ CCATACA ^ 

ACTTAAATATAACC ~ AAC^ATTTCCTATCTATCTCTCTTACCAAATCTCTCACTAAATCC ' ° ° 

C fr !YiCrr -*HRYTECFRDCFS 
C 1 - OS • S IOTQHAL ETO LA . 

N L Y * : L 5 A • I H R H L • R L I • L - 

9 o i 77^^5 A ?^ 7TACCTCT ^ 

AATCTTCTCT AATCC AC AAAACTAATCACAACC ACTAC AC AATAT AC AAATTOCTTCCT ' & ° 

*- 00ir C.-0Y5CSSLtSLXCA 
T " V: - ' ~ I* AH L L Y L * K K Q 

'-^CCLLLISY T F K R S R - 

CCCCTTTACTTTTCTTCTCATTTCTTTCT^ 1 ° 2 ° 

C C rt : * : * fl *- < »! E 1 S K F V O S S V T 

AXC *£O*RKRF0NLL fLL-p. 
* N£ * *^ *E»OFKICCFFCNR. 

10 2 1 C 5 AATTAAC AAC AACAA T ATTACCAACCAAAAACAACAAC ACCTATC ACAATCCTCATTC 

CCTTAATTCTrCTTCTT.-.7A.ATCCTTCC I I I I ' I C I T CTTCTCCATAGTCTTAGCACTAAC 1 ^ 

C ! N -< « 5 N £ k £ £ £ L S 0 S C F 

£ '- TTR I . A T K KXKSYHNPQS 
* * O ■? E i 0 ? K R 3 R A I T I L T L * 
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FIGURE 9 (cont.) 



235 1 GCAACATTAG AGGAAAGCTC CTTAGGATTC CTTAGAGATG AATCAATGAA 
2401 CCCT GAAAAT CCAAATGTTA ATCTTCTAAT GAGACTTACA GATGACTATC 
245 1 TTTTGATTAC AACTCAAGAG AATAATGCAG TATTGTTTAT TGAGAAACTT 
2501 ATAAACGTAA GTCGTGAAAA TGGATTTAAA TTCAATATGA AGAAACTACA 
255 1 GACTAGTTTT CCATTAAGTC CAAGCAAATT TGCAAAATAC GGAATGGATA 
2601 GTGTTGAGGA GCAAAATATT GTTCAAGATT ACTGCGATTG GATTGGCATC 
265 1 TCAATTGATA TG A AAACTCT TGCTTTAATG CCAAATATTA ACTTGAGAAT 
270 1 AGAAGGAATT CTGTGTACAC TCAATCTAAA CATGCAAACA AAGAAAGCAT 
275 I CAAT GTGGC T CAAGAAGAAA CTAAAGTCGT TTTTAATGAA TAACATTACC 
2801 CATTATTTTA GAAAGACGAT TACAACCGAA GACTTTGCGA ATAAAACTCT 
285 1 CAACAAGTTA TTTATATCAG GCGGTTACAA ATACATGCAA TGAGCCAAAG 
2901 AATACAAGGA CCACTTTAAG AAGAACTTAG CTATGAGCAG TATGATCGAC 
295 1 TTAGAGGTAT CTAAAATTAT ATACTCTGTA ACCAGAGCAT TCTTTAAATA 
3001 CCTTGTCTGC AATATTAAGG ATACAATTTT TGGAGAGGAG CATTATCCAG 
305 1 ACTTTTTCCT TAGCACACTG AAGCACTTTA TTGAAATATT CAGCACAAAA 
3101 AAGTAC ATTT TCAACAGAGT TTGCATGATC CTCAAGGCAA AAGAAGCAAA 
3151 GCTAAAAAGT GA CCAA TGTC AATCTCTAAT TCAATATG AT GCATAGTCGA 
3201 CTATT CTAA C TTATTTTGGA AAGTTAATTT TCAATTTTTG TCTTATATAC " 
3251 TGGGGTTTTG GGGTTTTGGG GTTTTGGGG 
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mTRT.acJ^P* hTRT DNA align FormattelB'^ nm ent 



hTCPl . 2 . gw 
mTRT . 1 . ad j . gw 

ConserLsus 



AGAGCACCCT CTCCCTGAGG AGATCCTOGC OIAGTTCCTG CACTGGCTGA 1700 
AGAGCACCGT CTGCGTGAGG AGATCCTCGC CAAQTTCCTG CACTGGCTGA 1700 



hTCPl . 2 . gw 
irtTRT . 1 . ad j . gw 

Consensus 



TGAGTCTCTA COTCOTCCAC CTCCTCACCT CTTTCTTTTA TOTCACCGAG 1750 
rTCAJOTGTCTA CGTCGTCGAG CTGCTCAGGT CTTTCTTTTA TGTCACGGAC 17S0 



hTCPl . 2 . gw 
mTRT . 1 . ad j . gw 

Cons oris us 



ACCACGTTTC AAAAGAACAG CCTCTTTTTC TACCGGAAGA CTCTCTGGAG 1800 
ACCACGTTTC AAAAGAACAG GCTCTTTTTC TACCGGAAGA OTOTCTGGAG 1800 



hTCPl. 2. gw 
MTRT . 1 . ad j . gw 

CorLScsrLsua 



hTCPl. 2. gw 
mTRT . 1 . ad j . gw 

Consensus 



CAAGTTGCAA AGCATTGGAA TCAGACAGCA CTTGAAGAGG GTGGAGCTGC 
CAAGTTGCAA AGCATTGGAA TCAGACAGCA CTTGAAGAGG GTGCACCTGC 
GGGAGCTGTC GGAAGCAGAG 



GGGAGCTGTC GGAAGCAGAG GTS 




3Gcg^«^bAc£a — Eh 

T3jS =\C"I 3C TAC £ V 

*MK3crAG§rt 



1850 

1850 

1897 
28 

1900 



hTCPl . 2 . gw 
mTRT . 1 . ad j . gw 

Consensus 





hTCPl. 2. gw 
mTRT . 1 . ad j . gw 

Consensus 




2000 



hTCPl . 2 . gw 
mTRT . 1 . ad j . gw 

Consensus 



^iSB AQ 9 ^^.-^^^^J^-^rCGJ^ GOTGdSSySK 

ccsAG^qsc ccagcatttc /c2c:^cSrcSfl^ M*£v 

Rjy^ff^x: ccagcatttc pfecflte^fe^^xacGAG ccrcaSg;^ 



2035 
163 

2050 



hTCPl . 2 . gw 
IrtTRT . 1 . ad j . gw 

Consensus 



gr:irrcAG|C3 
nres 

ITTCAGC R 



Z GAGC " GG 2G 



PG 



TGGffXSAKETA'k 
K3CTC&\CTA^ 

n?GCTCAACTA:: pj AGCqGGpG CGGCRyrn fe.tR Y^frC ^ ySG 2100 



CGGCGCCC £G'*- 
AATTCGPi ^T*tr£hj4; 



2085 
205 



hTCPl . 2 . gw 
WTRT . 1 . ad j . gw 

Consensus 



13 CTGGGCCTGG *CGj^lCC\ CAGGGCCTGG CGCACCTTCC 

— IdHTWTr CXjj&FA - Sc - - 

NiiVrtw;*^ -si's 

CGq^tllFBP CTGGGCCTGG J CGaiCfit l Km CAGGGCCTGG CGCACCTTCG 



2135 
217 

2150 



hTCPl . 2 . gw 
mTRT . 1 . adj . gw 

Consensus 



TGCTGCGTGT GCGG GC : 3* G : : GC 



fcTGTA CTTTGTCAAC 
C— — — — 

pTGTA CTTTGTCAAG 2200 



2185 
239 
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FIGURE 58 (com.) 



, , 140 150 
_rp gly leu ^eu leu arg arg val gly asp asp val le" val his 
TGG GGG CTG CTG CTG CGC CGC GTG GGC GAC GAC GTG CTG GTT CAC 

150 

i®" i eu ala arg °- /s ala leu P he v sl leu val ala ore ser cvs 
^ CTG GCA CGC TGC GCG CTC TTT GTG CTG GTG GCT CCC AGC TGC 



170 



180 



3w 5 yr gln Val cys gly pro pro leu c ^ r Sin ieu giv ala a^a 
CAG GTG TGC GGG CCG CCG CTG TAC CAG CTC GGC GCT GCC 

190 

?! n ala arg pro pro P ro his ala ssr giv ore arg a-q ara 
; ^CT CAG GCC CGG CCC CCG CCA CAC GCT AGT GGA CCC CgI AgS CGT 

5 , 200 210 

«f£ ?i y 3£ S lu arg ala trp asn his ser val ^ a i* giy 

ig= TG GG~ ,GC GAA CGG GCC TGG AAC CAT AGC GTC AGG GAG GCC GGG 

^ 220 

1SU 517 leU Pr ° ala Pr ° g1 ^ ala ar <? a ^g arg gly gly 
oiC CCC CTG GGC CTG CCA GCC CCG GGT GCG AGG AGG CGC GGG GGC 

230 240 
Ser arg ser leu pro leu pro ar 9 Pro arg arg giv 

GCC AGC C GA AGT CTG CCG TTG CCC AAG AGG CCC AGG CGT GGC 

250 

*~ft %!± ? r ° glu pro glu ar 3 chr P^ val gly gin gly ser cro 
-,CT GCC CCT GAG CCG GAG CGG ACG CCC GTT GGG CAG GGG TCC TGG 

, • • 260 270 

ala nis pro gly arg chr arg gly pro ser asp arg giv phe cvs 
jCC CAC CCG GGC AGG ACG CGT GGA CCG AGT GAC CGT GGT TTC TGT 

- _ 280 

it VSl ssr pro ala arg pro ala 9lu 9lu ala chr ser leu du 
jTG uTG T( =A CCT GCC AGA CCC GCC GAA GAA GCC ACC TCT ttq GAG 
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FIGURE 58 (cont.) 



gly ala leu ser gly chr arg his ser his oro s-r vaI ,h 

GGT GCG CTC TCT GGC ACG CGC CAC TCC CAC CCA TCC GTG GGC cH 

210 

gin his his ala glv pro pro ser th>- r->~ „ 

CAG CAC CAC GCG GGC ?CC ? C A TCC ACA IcI c'g! CCA CcI S CCC 

320 

TG- ?S ^ 252 Pr ° Pr ° Val t - Vr al * Slu Chr Ivs nhe 

TG_- GAC rtC G CCT TGT CCC CCG GTG TAG GCC GAG ACC AaS C^ ?tc 

, 340 

dc TAC TCC TCA & S? AAG Iag" SS cS £ S TCC & 
OS He TCT l ACI IS He & £ & Si? S S £ C?G 
^ - - - - £ TCC £ E ^ ATG CCA £ £ ?CC° 
£ S TTG IS S? C^ £ SS £? £ S SS S l £ 

™ ss & Iag & & S2 i & n & s? - is 

QTQ CTC -TC S Si? ^ ^ Pr ° 1SU 3rg * la ^1 ^ Pro 

S— CTC .TC AAG ACG CAC TGC CCG CTG CGA C-CT GCG GTC ACC CCA 

tit til ll y val ala arg glu lys oro gin gly ser val ala 

GCA ^CC GGT GTC TGT GCC CGG GAG AAG CCC CAG Ic-C TCT GTG IcG 
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FIGURE 58 (cont.) 



440 



450 



aia pre glu glu glu asp thr asp pro arg arc leu val aln leu 
GCC CCC GAG GAG GAG GAG ACA GAC CCC CGT CGG GTG GTG CAG CTG 

450 

leu arg gin his ser ser pro trp gin val tvr giy ohe val arg 
CTC CGC CAG CAC AGC AGC CCC TGG CAG GTG TAC GGC TTC GTG CGG 

470 AQQ 

ala cys isu arg arg leu val pro pro gly leu t ro aiy ser ara 
G^ TGC CTG CGC CGG CTG GTG CCC CCA GGC CTC TGG GGC TCC AGG 

490 

his asn glu arg arg phe leu arg asn thr lys lys phe lie ser 
CAC AAC GAA CGC CGC TTC CTC AGG AAC ACC AAC- AAG TTC ATC TCC 

- . 300 510 
leu giy ivs hi 3 ala lys leu ser leu gin glu leu t-p lys 

v-TG GGG AAG CAT GCC AAG CTC TCG CTG CAG GAG CTG ACG TGG AAG 

520 

"TrS f er val ar 9 as P c ys ala trp leu arg arg ser pro gly val 
GTG C ^ GAC TGC GCT TGG CTG CGC AGG AGC CCA GGG GTT 

530 -40 
giy cys vai pro ala ala glu his arg leu arg criu glu ile leu 
GGC TGT GTT CCG GCC GCA GAG CAC CGT CTG CG? GAG GAG AT? CTG 

550 

ala lys phe leu his trp leu met ser val tyr val val glu W 
G^C AAC- ,TC CTG CAC TGG CTG ATG AGT GTG TAC GTC GTC GAG CTG 

550 570 
xeu arg ser phe phe tyr val thr glu thr thr ohe gin ly« asn 
CTC AGG TCT TTC TTT TAT GTC ACG GAG ACC ACG TTT CAA AAG AAC 

580 

arg leu phe phe tyr arg pro ser val trp ser lys i «u gin ser 
AGG- CTC TTT TTC TAC CGG CCG AGT GTC TGG AGC AAG TTG CAA AGC 

• , . " u 600 

:i® giy lia arg gln his leu ^-3 arg val gin leu arg glu leu 
ATT -oGA .-.Tl AGA CAG CAC TTG AAG AGG GTG CAG CTG CGG GAG CTG 
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FIGURE 58 (cont.) 

610 

ser giu ala glu val arg gin his arg glu ala arg pro ala i eu 
TCG GAA GCA GAG GTC AGG CAG GAT GGG GAA GGC AGG CGC GCC 



CTG 



620 



30 



leu chr ssr arg leu arg phe ile pro lys pro asp gly leu aro 
CTG AGG TCC AGA GTC CGC TTC ATC CCC AAG OCT GAC GGG CTG CGG 

640 

^ l£ 5Sn mSC SSP tyr val val £la ar 9 chr ohe aro 

Co AT- _-Tu- AAC ATG GAC TAC GTC GTG GGA GCC AGA ACG TTC CGC 

650 660 
arg glu ivs arg ala glu arg leu chr ser arg val lys a^a l« u 
AGA GAA AAG AGG GCC GAG CGT CTC ACC TCG AGG QTG AAG GCA 



CT 



5^2 f ? r val leu asn ^ r Srlu arg ala arg aro pro gly le" leu 
TTC AGC GTG CTC AAC TAC GAG CGG GCG CGG CGC CCC GGC CTC CTT 



570 



r 



80 



6 S 0 

ZjZ XZ a ^ r Val lau gly lau £S P as P -ile his arg ala tro arg 
GGC GCC xCT GTG CTG , GGC CTG GAC GAT ATC CAC AGG GCC TGG CGC 

700 

c*-r one val leu arg val arg ala oln asp pr- 0 n^o oro oli> 
ACC TTC GTG CTG CGT GTG CGG GCC CAG GAC C~CG CCG CGT G^G CTG 



10 720 
tyr one val lys val asp val chr gly ala cvr asp chr ile pro 
TAC TTT GTC AAG GTG GAT GTG ACG GGC GCG TAC GAC ACC ATC CCC 

720 

gin asp arg leu Chr glu val ile ala ser ile ile lys oro c'n 
CAG GAC AGG CTC ACG GAG GTC ATC GCC AGC ATC ATC AAA CCC CAG 



740 



750 



aia 



asn chr tyr cys val arg arg cyr ala val val gin lys ala 
mAv- ACG TAC TGC GTG CGT CGG TAT GCC GTG GTC CAG AAG GCC GCC 

760 

nis gly his val arg lys ala phe lys ser his val ser chr leu 
CAT GGG CAC GTC CGC AAG GCC TTC AAG AGC CAC GTC TCT ACC TTG 
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FIGURE 5S (cont.) 

770 

cnr asp leu gin pro cyr met arg cln -h- vai - i a hl ~ 80 - 
ACA GAC CTC CAG CCG TAC ATG CGA CAG TTC GTG C~CT CAC CTG 

gxu tnr ser pro leu arg asp ala vai i i - r-i, t 

GAG ACC AGC CCG CTG AcJ GA? GCC GTC GTC ATC f Ag" l£ aSc 

900 

^ ™ f S S glu ala ser Sly leu ohe aso vai one 

TCC C:G AAT GAG GCC AGC AGT GGC CTC TTC GAC G7C~ TTC CTA CgI 

820 

pne mat cys his his ala vai arg ile a^-a alv - 

TTC ATG TGC CAC CAC GCC GTG C G ? ATC IcJ ?gC «l TCC TAC GTC 

830 

?ip f£ ^Z* gly lle P ro srln gly ser He ieu s-r — 
v-AG TGC CAG GGG ATC CCG CAG GGC TCC ATC CTC TCC xrr. 



840 

i eu i eu 
"CC ACG CTG CTC 



52 - CTG £ taI & SS ATG & £ CTG £ £ GGG 

360 

^ ^ leu leu arg Leu vai aso asp phe leu 

Cv - G ^ CTG C ^C CTG CGT TTC- GTG GAT GAT TTC TTG 

T?G G?~ Sr? ™ 1SU Chr hlS lys chr leu arg thr 

TTC ^ mCA CCT CAC CTC ACC CAC GCG AAA ACC TTC CTC AGG ACC 

390 

leu vai arg gly vai pro glu cvr gly C v, vai ™i * ?°° 
CTG GTC CGA GGT GTC CCT Lg TAT g£ £c GTG CTG S£ TTG SS 

A^G ™ j£i 1^ aSn Phe pro val glu aia ieu gly gly 

~ CA ^ C TTC CCT GTA GAA GAC GAG GCC CTG Sct 2g£ 

920 

^ S he V21 mec P ro his aly ieu phe oro ^ro cvs 

ACG TTT GTT CAG ATG CCG GCC CAC GGC CTA TTC CCC TGG TGC 
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FIGURE 58 ( cont. ) 



gly leu .eu ±eu asp thr arg chr leu glu val gln sa»- acn r-,-. 
GGC CTG CTG CTG GAT ACC CGG ACC CTG GAG GTG CAG a2 C g!c TAG 

950 960 
ser ser cyr ala arg chr ser ile arg ala ser vai C n r D h e = cn 
TCC AGC TAT GCC CGG ACC TCC ATC AGA GCC AGT GTC ACC TTC tic 

970 

arg gly phe lys ala gly arg asn met arg arg lys t « u ch? niu 
CGC GGC TTC AAG GCT GGG AGG AAC ATG CG? CgI AaI Ct'c TTT Gob 

• - 980 990 

™ ^ ^ i eu lys ^ s h ^ ser leu P he leu aso leu aln val 
oTC TTG CGG CTG AAG TGT CAC AGC CTG TTT CTG GAT TTG CAG GTC 

*S ^ gln thr Val Cys Chr asn'ile cyr lys il- leu leu 

AAC AGC CTC CAG ACG GTG TGC ACC AAC ATC TAC A^G ATC* C?C CTG 

1010 

leu gin ala cyr arg. phe his ala cys -val i eu aln i « u d-o nh, 
CTG CAC- GCG TAC AGG TTT CAC GCA TGT GTG CTG CAG CTC CCA TTT 

, 1030 
™ g gin val trp ± y s asn P ro chr ohe phe « u a-a v*i n 0 
CAT CAG C.AA GTT TGG AAG AAC CCC ACA TTT TTC CTG CGC GTC ATC 

1040 

ser asp chr ala ser leu cvs sp- ; i s i i, l(> i lObO 

TCT CAC ACG GCC TCC CTC TGC & £ ^ ^ & AAG ^ 



-J Q g Q 

pit? ?^ Ser leu gLy ala ^ s 5ly ala ala glv oro leu o-o 

GCA v,GC- ATG TCG CTG GGG GCC AAG GGC GCC GCC GGC CCT CTG CCC 

1070 10g0 

*W ™ X2i gln Crp leu c ^ s his 5^ ala phe leu leu lys 

TCC GAG oCC GTG CAG TGG CTG TGC CAC CAA GCA TTC CTG CTC AAG 

~ ~? rtr ~H Cyr ^i°P ro leu gly ser leu 

CTG Av.T _A CAC CGT GTC ACC TAC GTG CCA CTC CTG GGG TCA CTC 
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FIGURE 58 (com.) 



iioo ill0 

arg cnr ala gin thr gin leu ser arg lys leu oro gly chr thr 
AGG ACA GCC CAG ACG CAG CTG AGT CGG AAG CTC CCG GGG ACG ACG 

1120 

leu cnr ala ieu glu ala ala ala asn pre ala leu o^c se- a^o 
CTG AC. GCC CTG- GAG GCC GCA GCC AAC CCG GCA CTG CCC TCA GAC 

1130 1132 
phe lys chr ile leu asp O? 

TTC AAG ACC ATC CTG GAC TGA TGGCCACCCGCCCACAGCCAGGCCGAGAGCAGA 

CACCAC-CAGCCCTGTCACGCCGGGCTCTACGTCCCAGGGAC-GGAGGGGCGGCCCACACCC 

AGGCCCGCACCGCTGGGAGTCTGAGGCCTGAGTGAGTGTTTGGCCGAGGCCTGCATGTCC 

C^rCTGAJ^.GGCTGAGTGTCCGGCTGAGGCCTGAGCGAGTGTCCAGCCA-AGGGCTGAGTGTC 

CAGCACACC-GCCGTCTTCACTTCCCCACAGGCTGGCGCTCGGCTCCACCCCAGGGCCAG 

CTTTTCYTCACCAGGAGCCCGGCTTCCACTCCCCACATAGGAATAGTCCATCCCCAGATT 

CGCCATTGTTCACCCYTCGCCCTGCCYTCCTTTGCCTTCCACCCCCACCATCCAGGTGGA 

GACCCTGAGAAGGACCCTGGGAGCTCTGGGAATTTGGAGTGACCAAAGGTGTGCCCTGTA 

CACA^CGAC^ACCCTC-CACCTGGA'rGGGGGTCCCTGTGGGTC.AA.aT7GC-GGGGAGGTGC 
TCTCX^AGTAAAATA^^ 

AAAAAAAAAA 
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(figure 60, SEQ ID NO: 124; page 2 of 2) 

iS! ^ GGCCCAGCA1TTCACCCAGCG TCTCAAGACTCTCTTCAGCATGCTCAAC 
2001 TATGAGCCGACAAAACATCCTCACCTTATGGGGTCTTCTGTA^CTGGGTAT 
2051 GAATGACATCTACAGGACCTGGCGGGCCTTTGTGCTGCGTGTGCGTGCTC 

2101 tggaccanacaccangatgtactttgttaaggcagatgtgaccggSot 

ATGATGCCATCCCCAGGGTAAGCTGGGTGNNNNNNNNNNNNNNNNNNNNN 
NNNNNNNNNNNNNNNNNNNNTGGTGGAGGTTGTTGCCAATATGATCAGG^ 



2251 ATTCGGAGAGCACGTACTGTATCCGCCAGTATGCACTGGT^ 

2301 AGCCAAGGCCAAGTCCACAAGTCNTTTAGGAGACAGGTCACCACCnTCTC 

™J ™ OTCCAGCCATAC ^^ 

240 ATGCCAGTGCACTGAGGAACTCCGTTGTCATCGAGCAGAGCATCTCT^TG 

245 1 AATGAGAGCAGCAGCAGCCTGTTTGATTCTTCCTGCACTTCCTGCGTCAC 

2501 AG ^ G ^ GG ^AAAGATTGGTGACAGGTGCTATACGCAGTGCCAGGGCA^^CC 

^ CC ^ GGGCTCCAGCCT ^^ 

2601 TGGAGAACAAGCTGTTTGCTGAGGTGCAGCGGGATGGGTTGC^ACGT 

2651 TTTGTTGATGACTTTCTGTTGGTGACGCCTCACTTGGACCAAGC^OCXJO^ 

2701 

2901 GCCCAGACCTCAATTAAGACGAGCCTCACCTTCCAGAGTGTCT 



2751 XXXXXXXXXXXXXXXXXXXXXXXXXXXX~xTx35^^^ 
2851 



2801 



XXXX 



"TCACCTTCCAGAGTGTCTTCAAA 
, CTGTCr,r»TrTTr,rr.r.TTn a a^/- 

3001 ACGGTCTATTTCTAGACTTGCAGGT 



• 



SEQ ID NO:122/123: (page2 of7 for Fig. 59: SEQ ID NO: 122/123) 

140 150 
tip gly leu leu leu arg arg val gly asp asp val leu val his 

TGG GGG CTG CTG CTG CGC CGC GTG GGC GAC GAC GTG CTG GTT CAC 
160 

leu leu ala arg cys ala leu phe val leu val ala pro ser cys 

CTG CTG GCA CGC TGC GCG CTC TTT GTG CTG GTG GCT CCC AGC TGC 

170 180 
ala tyr gin val cys gly pro pro leu tyr gin leu gly ala ala 

GCC TAC CAG GTG TGC GGG CCG CCG CTG TAC CAG CTC GGC GCT GCC 
190 

thr gin ala arg pro pro pro his ala ser gly pro arg arg arg 

ACT CAG GCC CGG CCC CCG CCA CAC GCT AGT GGA CCC CGA AGG CGT 

200 210 
leu gly cys glu arg ala trp asn his ser val arg glu ala gly 

CTG GGA TGC GAA CGG GCC TGG AAC CAT AGC GTC AGG GAG GCC GGG 
220 

val pro leu gly leu pro ala pro gly ala arg arg arg gly gly 

GTC CCC CTG GGC CTG CCA GCC CCG GGT GCG AGG AGG CGC GGG GGC 

230 240 
ser ala ser arg ser leu pro leu pro lys arg pro arg arg gly 

AGT GCC AGC CGA AGT CTG CCG TTG CCC AAG AGG CCC AGG CGT GGC 
250 

ala ala pro glu pro glu arg thr pro val gly gin gly ser trp 

GCT GCC CCT GAG CCG GAG CGG ACG CCC GTT GGG CAG GGG TCC TGG 

260 270 
ala his pro gly arg thr arg gly pro ser asp arg gly phe cys 

GCC CAC CCG GGC AGG ACG CGT GGA CCG AGT GAC CGT GGT TTC TGT 
280 

val val ser pro ala arg pro ala glu glu ala thr ser leu glu 

GTG GTG TCA CCT GCC AGA CCC GCC GAA GAA GCC ACC TCT TTG GAG 



SEQ ID NO:122/123: (page 3 of 7 for Fig. 59: SEQ ID NO: 122/123) 

290 300 
gly ala leu ser gly thr arg his ser his pro ser val gly arg 

GGT GCG CTC TCT GGC ACG CGC CAC TCC CAC CCA TCC GTG GGC CGC 
310 

gin his his ala gly pro pro ser thr ser arg pro pro arg pro 

CAG CAC CAC GCG GGC CCC CCA TCC ACA TCG CGG CCA CCA CGT CCC 

320 330 
trp asp thr pro cys pro pro val tyr ala glu thr lys his phe 

TGG GAC ACG CCT TGT CCC CCG GTG TAC GCC GAG ACC AAG CAC TTC 
340 

leu tyr ser ser gly asp lys glu gin leu arg pro ser phe leu 

CTC TAC TCC TCA GGC GAC AAG GAG CAG CTG CGG CCC TCC TTC CTA 

350 360 
leu ser ser leu arg pro ser leu thr gly ala arg arg leu val 

CTC AGC TCT CTG AGG CCC AGC CTG ACT GGC GCT CGG AGG CTC GTG 
370 

glu thr ile phe leu gly ser arg pro trp met pro gly thr pro 

GAG ACC ATC TTT CTG GGT TCC AGG CCC TGG ATG CCA GGG ACT CCC 

380 390 
arg arg leu pro arg leu pro gin arg tyr trp gin met arg pro 

CGC AGG TTG CCC CGC CTG CCC CAG CGC TAC TGG CAA ATG CGG CCC 
400 

leu phe leu glu leu leu gly asn his ala gin cys pro tyr gly 

CTG TTT CTG GAG CTG CTT GGG AAC CAC GCG CAG TGC CCC TAC GGG 

410 420 
val leu leu lys thr his cys pro leu arg ala ala val thr pro 

GTG CTC CTC AAG ACG CAC TGC CCG CTG CGA GCT GCG GTC ACC CCA 
430 

ala ala gly val cys ala arg glu lys pro gin gly ser val ala 

GCA GCC GGT GTC TGT GCC CGG GAG AAG CCC CAG GGC TCT GTG GCG 



SEQ ID NO: 122/123: (page 4 of 7 for Fig. 59: SEQ ID NO: 122/123) 

440 450 
ala pro glu glu glu asp thr asp pro arg arg leu val gin leu 

GCC CCC GAG GAG GAG GAC ACA GAC CCC CGT CGC CTG GTG CAG CTG 
460 

leu arg gin his ser ser pro trp gin val tyr gly phe val arg 

CTC CGC CAG CAC AGC AGC CCC TGG CAG GTG TAC GGC TTC GTG CGG 

470 480 
ala cys leu arg arg leu val pro pro gly leu trp gly ser arg 

GCC TGC CTG CGC CGG CTG GTG CCC CCA GGC CTC TGG GGC TCC AGG 
490 

his asn glu arg arg phe leu arg asn thr lys lys phe ile ser 

CAC AAC GAA CGC CGC TTC CTC AGG AAC ACC AAG AAG TTC ATC TCC 

500 510 
leu gly lys his ala lys leu ser leu gin glu leu thr trp lys 

CTG GGG AAG CAT GCC AAG CTC TCG CTG CAG GAG CTG ACG TGG AAG 
520 

met ser val arg asp cys ala trp leu arg arg ser pro gly val 

ATG AGC GTG CGG GAC TGC GCT TGG CTG CGC AGG AGC CCA GGG GTT 

530 540 
gly cys val pro ala ala glu his arg leu arg glu glu ile leu 

GGC TGT GTT CCG GCC GCA GAG CAC CGT CTG CGT GAG GAG ATC CTG 
550 

ala lys phe leu his trp leu met ser val tyr val val glu leu 

GCC AAG TTC CTG CAC TGG CTG ATG AGT GTG TAC GTC GTC GAG CTG 

560 570 
leu arg ser phe phe tyr val thr glu thr thr phe gin lys asn 

CTC AGG TCT TTC TTT TAT GTC ACG GAG ACC ACG TTT CAA AAG AAC 
580 

arg leu phe phe tyr arg lys ser val trp ser lys leu gin ser 

AGG CTC TTT TTC TAC CGG AAG AGT GTC TGG AGC AAG TTG CAA AGC 



SEQ ID NO.-122/123: (page 5 of 7 for Fig. 59: SEQ ID NO: 122/123) 

590 600 
ile gly ile arg gin his leu lys arg val gin leu arg glu leu 

ATT GGA ATC AGA CAG CAC TTG AAG AGG GTG CAG CTG CGG GAG CTG 
610 

ser glu ala glu val arg gin his arg glu ala arg pro ala leu 

TCG GAA GCA GAG GTC AGG CAG CAT CGG GAA GCC AGG CCC GCC CTG 

620 630 
leu thr ser arg leu arg phe ile pro lys pro asp gly leu arg 

CTG ACG TCC AGA CTC CGC TTC ATC CCC AAG CCT GAC GGG CTG CGG 
640 

pro ile val asn met asp tyr val val gly ala arg thr phe arg 

CCG ATT GTG AAC ATG GAC TAC GTC GTG GGA GCC AGA ACG TTC CGC 

650 660 
arg glu lys arg ala glu arg leu thr ser arg val lys ala leu 

AGA GAA AAG AGG GCC GAG CGT CTC ACC TCG AGG GTG AAG GCA CTG 
670 

phe ser val leu asn tyr glu arg ala arg arg pro gly leu leu 

TTC AGC GTG CTC AAC TAC GAG CGG GCG CGG CGC CCC GGC CTC CTG 

680 690 
gly ala ser val leu gly leu asp asp ile his arg ala trp arg 

GGC GCC TCT GTG CTG GGC CTG GAC GAT ATC CAC AGG GCC TGG CGC 
700 

thr phe val leu arg val arg ala gin asp pro pro pro glu leu 

ACC TTC GTG CTG CGT GTG CGG GCC CAG GAC CCG CCG CCT GAG CTG 

710 720 
tyr phe val lys val asp val thr gly ala tyr asp thr ile pro 

TAC TTT GTC AAG GTG GAT GTG ACG GGC GCG TAC GAC ACC ATC CCC 
730 

gin asp arg leu thr glu val ile ala ser ile ile lys pro gin 

CAG GAC AGG CTC ACG GAG GTC ATC GCC AGC ATC ATC AAA CCC CAG 



SEQ ID NO:122/123: (page 6 of 7 for Fig. 59: SEQ ID NO: 122/123) 



740 750 
asn thr tyr cys val arg arg tyr ala val val gin lys ala ala 

AAC ACG TAC TGC GTG CGT CGG TAT GCC GTG GTC CAG AAG GCC GCC 
760 

his gly his val arg lys ala phe lys ser his val leu arg pro 

CAT GGG CAC GTC CGC AAG GCC TTC AAG AGC CAC GTC CTA CGT CCA 

770 780 
val pro gly asp pro ala gly leu his pro leu his ala ala leu 

GTG CCA GGG GAT CCC GCA GGG CTC CAT CCT CTC CAC GCT GCT CTG 
790 

gin pro val leu arg arg his gly glu gin ala val cys gly asp 

CAG CCT GTG CTA CGG CGA CAT GGA GAA CAA GCT GTT TGC GGG GAT 

800 807 
ser ala gly arg ala ala pro ala phe gly gly OP 

TCG GCG GGA CGG GCT GCT CCT GCG TTT GGT GGA TGA 

TTTCTTGTTGGTGACACCTCACCTCACCCACGCGAAAACCTTCCTCAGGACCCTGGT 

CCGAGGTGTCCCTGAGTATGGCTGCGTGGTGAACTTGCGGAAGACAGTGGTGAACTT 

CCCTGTAGAAGACGAGGCCCTGGGTGGCACGGCTTTTGTTCAGATGCCGGCCCACGG 

CCTATTCCCCTGGTGCGGCCTGCTGCTGGATACCCGGACCCTGGAGGTGCAGAGCGA 

CTACTCCAGCTATGCCCGGACCTCCATCAGAGCCAGTCTCACCTTCAACCGCGGCTT 

CAAGGCTGGGAGGAACATGCGTCGCAAACTCTTTGGGGTCTTGCGGCTGAAGTGTC 

ACAGCCTGTTTCTGGATTTGCAGGTGAACAGCCTCCAGACGGTGTGCACCAACATCT 

ACAAGATCCTCCTGCTGCAGGCGTACAGGTTTCACGCATGTGTGCTGCAGCTCCCAT 

TTCATCAGCAAGTTTGGAAGAACCCCACATTTTTCCTGCGCGTCATCTCTGACACGG 

CCTCCCTCTGCTACTCCATCCTGAAAGCCAAGAACGCAGGGATGTCGCTGGGGGCCA 

AGGGCGCCGCCGGCCCTCTGCCCTCCGAGGCCGTGCAGTGGCTGTGCCACCAAGCA 

TTCCTGCTCAAGCTGACTCGACACCGTGTCACCTACGTGCCACTCCTGGGGTCACTC 

AGGACAGCCCAGACGCAGCTGAGTCGGAAGCTCCCGGGGACGACGCTGACTGCCCT 

GGAGGCCGCAGCCAACCCGGCACTGCCCTCAGACTTCAAGACCATCCTGGACTGAT 

GGCCACCCGCCCACAGCCAGGCCGAGAGCAGACACCAGCAGCCCTGTCACGCCGGG 

CTCTACGTCCCAGGGAGGGAGGGGCGGCCCACACCCAGGCCCGCACCGCTGGGAGT 

CTGAGGCCTGAGTGAGTGTTTGGCCGAGGCCTGCATGTCCGGCTGAAGGCTGAGTGT 

CCGGCTGAGGCCTGAGCGAGTGTCCAGCCAAGGGCTGAGTGTCCAGCACACCTGCC 

GTCTTCACTTCCCCACAGGCTGGCGCTCGGCTCCACCCCAGGGCCAGCTTTTCCTCA 



SEQ ID NO:122/123: (page 7 of 7 for Fig. 59: SEQ ID NO: 122/123) 

CCAGGAGCCCGGCTTCCACTCCCCACATAGGAATAGTCCATCCCCAGATTCGCCATT 

GTTCACCCCTCGCCCTGCCCTCCTTTGCCTTCCACCCCCACCATCCAGGTGGAGACCC 

TGAGAAGGACCCTGGGAGCTCTGGGAATTTGGAGTGACCAAAGGTGTGCCCTGTAC 

ACAGGCGAGGACCCTGCACCTGGATGGGGGTCCCTGTGGGTCAAATTGGGGGGAGG 

TGCTGTGGGAGTAAAATACTGAATATATGAGTTTTTCAGTTTTGAAAAAAAAAAAAA 

AAAAAAAAAAAAAAAA 

Translated Mol. Weight = 90239.69 



